sagawa
/

ReactionT5v2-forward-USPTO_MIT

Model card Files Files and versions Community

sagawa commited on Sep 9

Commit

085d475

•

1 Parent(s): b678cd1

Update README.md

Files changed (1) hide show

README.md +12 -13

README.md CHANGED Viewed

@@ -53,27 +53,26 @@ output # 'CN1CCC=C(CO)C1'
 ### Training Procedure
 <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
-We used the Open Reaction Database (ORD) dataset for model training.
 The command used for training is the following. For more information, please refer to the paper and GitHub repository.
 ```python
-python train_without_duplicates.py \
-    --model='t5' \
-    --epochs=100 \
-    --lr=1e-3 \
     --batch_size=32 \
-    --input_max_len=150 \
-    --target_max_len=100 \
-    --weight_decay=0.01 \
     --evaluation_strategy='epoch' \
     --save_strategy='epoch' \
     --logging_strategy='epoch' \
-    --train_data_path='/home/acf15718oa/ReactionT5_neword/data/all_ord_reaction_uniq_with_attr20240506_v3_train.csv' \
-    --valid_data_path='/home/acf15718oa/ReactionT5_neword/data/all_ord_reaction_uniq_with_attr20240506_v3_valid.csv' \
-    --test_data_path='/home/acf15718oa/ReactionT5_neword/data/all_ord_reaction_uniq_with_attr20240506_v3_test.csv' \
-    --USPTO_test_data_path='/home/acf15718oa/ReactionT5_neword/data/USPTO_MIT/MIT_separated/test.csv' \
     --disable_tqdm \
-    --pretrained_model_name_or_path='sagawa/ZINC-t5'
 ```
 ### Results

 ### Training Procedure
 <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
+We used the [USPTO_MIT dataset](https://yzhang.hpc.nyu.edu/T5Chem/index.html) for model finetuning.
 The command used for training is the following. For more information, please refer to the paper and GitHub repository.
 ```python
+cd task_forward
+python finetune.py \
+    --output_dir='t5' \
+    --epochs=50 \
+    --lr=2e-5 \
     --batch_size=32 \
+    --input_max_len=200 \
+    --target_max_len=150 \
     --evaluation_strategy='epoch' \
     --save_strategy='epoch' \
     --logging_strategy='epoch' \
+    --save_total_limit=10 \
+    --train_data_path='../data/USPTO_MIT/MIT_separated/train.csv' \
+    --valid_data_path='../data/USPTO_MIT/MIT_separated/val.csv' \
     --disable_tqdm \
+    --model_name_or_path='sagawa/ReactionT5v2-forward'
 ```
 ### Results