CombinHorizon commited on
Commit
7f03067
1 Parent(s): 6ceede4

Update README.md

Browse files

add merge details

Files changed (1) hide show
  1. README.md +34 -1
README.md CHANGED
@@ -14,4 +14,37 @@ ECE-TW3-JRGL-V3 is a merge of the following models using [mergekit](https://gith
14
  * [migtissera/Tess-72B-v1.5b](https://huggingface.co/migtissera/Tess-72B-v1.5b)
15
  * [abacusai/Smaug-72B-v0.1](https://huggingface.co/abacusai/Smaug-72B-v0.1)
16
 
17
- ## 🧩 Configuration
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
14
  * [migtissera/Tess-72B-v1.5b](https://huggingface.co/migtissera/Tess-72B-v1.5b)
15
  * [abacusai/Smaug-72B-v0.1](https://huggingface.co/abacusai/Smaug-72B-v0.1)
16
 
17
+ ## 🧩 Configuration
18
+
19
+ ```yml
20
+ base_model: migtissera/Tess-72B-v1.5b
21
+ dtype: bfloat16
22
+ merge_method: slerp
23
+ parameters:
24
+ t:
25
+ - filter: self_attn
26
+ value:
27
+ - 0
28
+ - 0.5
29
+ - 0.3
30
+ - 0.7
31
+ - 1
32
+ - filter: mlp
33
+ value:
34
+ - 1
35
+ - 0.5
36
+ - 0.7
37
+ - 0.3
38
+ - 0
39
+ - value: 0.5
40
+ slices:
41
+ - sources:
42
+ - layer_range:
43
+ - 0
44
+ - 80
45
+ model: migtissera/Tess-72B-v1.5b
46
+ - layer_range:
47
+ - 0
48
+ - 80
49
+ model: abacusai/Smaug-72B-v0.1
50
+ ```