ELYZA-japanese-Llama-2-MoE-2x13B-v0.1 / mergekit_moe_config.yml
Aratako
upload model
a2eff46
raw
history blame
317 Bytes
base_model: ./ELYZA-japanese-Llama-2-13b-instruct
gate_mode: random
dtype: bfloat16
experts:
- source_model: ./ELYZA-japanese-Llama-2-13b-instruct
positive_prompts: []
- source_model: ./ELYZA-japanese-Llama-2-13b
positive_prompts: []
tokenizer_source: model:./ELYZA-japanese-Llama-2-13b-instruct