gbyuvd
/

chemfie-gpt-experiment-1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

gbyuvd commited on Aug 15

Commit

4f592de

•

1 Parent(s): 236eab7

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -80,8 +80,9 @@ C(CCCCCCCCO)=CCC=C
 ## Training Procedure
 - **Batch Size**: 64
 - **Learning Rate**: 1.5e-5
-- **Optimizer**: Ranger21 (MADGRAD-Lookahead-AdaBelief with gradient centralization, gradient clipping, and weight decay)
 ## Training Logs

 ## Training Procedure
 - **Batch Size**: 64
+- **Num Epoch for Each Chunk**: 1
 - **Learning Rate**: 1.5e-5
+- **Optimizer**: Ranger21 (MADGRAD-Lookahead-AdaBelief with gradient centralization, linear warm up (22%), gradient clipping, and L2 weight decay)
 ## Training Logs