sagawa commited on
Commit
085d475
1 Parent(s): b678cd1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -13
README.md CHANGED
@@ -53,27 +53,26 @@ output # 'CN1CCC=C(CO)C1'
53
  ### Training Procedure
54
 
55
  <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
56
- We used the Open Reaction Database (ORD) dataset for model training.
57
  The command used for training is the following. For more information, please refer to the paper and GitHub repository.
58
 
59
  ```python
60
- python train_without_duplicates.py \
61
- --model='t5' \
62
- --epochs=100 \
63
- --lr=1e-3 \
 
64
  --batch_size=32 \
65
- --input_max_len=150 \
66
- --target_max_len=100 \
67
- --weight_decay=0.01 \
68
  --evaluation_strategy='epoch' \
69
  --save_strategy='epoch' \
70
  --logging_strategy='epoch' \
71
- --train_data_path='/home/acf15718oa/ReactionT5_neword/data/all_ord_reaction_uniq_with_attr20240506_v3_train.csv' \
72
- --valid_data_path='/home/acf15718oa/ReactionT5_neword/data/all_ord_reaction_uniq_with_attr20240506_v3_valid.csv' \
73
- --test_data_path='/home/acf15718oa/ReactionT5_neword/data/all_ord_reaction_uniq_with_attr20240506_v3_test.csv' \
74
- --USPTO_test_data_path='/home/acf15718oa/ReactionT5_neword/data/USPTO_MIT/MIT_separated/test.csv' \
75
  --disable_tqdm \
76
- --pretrained_model_name_or_path='sagawa/ZINC-t5'
77
  ```
78
 
79
  ### Results
 
53
  ### Training Procedure
54
 
55
  <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
56
+ We used the [USPTO_MIT dataset](https://yzhang.hpc.nyu.edu/T5Chem/index.html) for model finetuning.
57
  The command used for training is the following. For more information, please refer to the paper and GitHub repository.
58
 
59
  ```python
60
+ cd task_forward
61
+ python finetune.py \
62
+ --output_dir='t5' \
63
+ --epochs=50 \
64
+ --lr=2e-5 \
65
  --batch_size=32 \
66
+ --input_max_len=200 \
67
+ --target_max_len=150 \
 
68
  --evaluation_strategy='epoch' \
69
  --save_strategy='epoch' \
70
  --logging_strategy='epoch' \
71
+ --save_total_limit=10 \
72
+ --train_data_path='../data/USPTO_MIT/MIT_separated/train.csv' \
73
+ --valid_data_path='../data/USPTO_MIT/MIT_separated/val.csv' \
 
74
  --disable_tqdm \
75
+ --model_name_or_path='sagawa/ReactionT5v2-forward'
76
  ```
77
 
78
  ### Results