--- datasets: - JeanKaddour/minipile language: - en license: mpl-2.0 --- # RWKV Minipile ## Model Specifications - **Architecture**: RWKV - **Vocabulary Size**: 65,536 - **Embedding Size**: 768 - **Number of Layers**: 12 - **Context Length**: 512 - **Data Type**: bfloat16 - **Dataset**: Minipile - **Tokens**: 20,643,840 (20 Million) The model underwent a rigorous training regimen, completing 30 epochs to optimize performance. ## Inference ```bash pip install torch numpy python inference.py ```