kromvault
/

L3-Himerus-Basis.C-13B

@@ -1,82 +1,136 @@
----
-base_model:
-- Sao10K/L3-8B-Niitama-v1
-- Nitral-AI/Hathor_Tahsin-L3-8B-v0.85
-- ArliAI/ArliAI-Llama-3-8B-Formax-v1.0
-- nothingiisreal/L3-8B-Celeste-V1.2
-library_name: transformers
-tags:
-- mergekit
-- merge
----
-# merge
-This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
-## Merge Details
-### Merge Method
-This model was merged using the passthrough merge method.
-### Models Merged
-The following models were included in the merge:
-* parts/celeniit14-20.sl
-* [Sao10K/L3-8B-Niitama-v1](https://huggingface.co/Sao10K/L3-8B-Niitama-v1)
-* [Nitral-AI/Hathor_Tahsin-L3-8B-v0.85](https://huggingface.co/Nitral-AI/Hathor_Tahsin-L3-8B-v0.85)
-* [ArliAI/ArliAI-Llama-3-8B-Formax-v1.0](https://huggingface.co/ArliAI/ArliAI-Llama-3-8B-Formax-v1.0)
-* [nothingiisreal/L3-8B-Celeste-V1.2](https://huggingface.co/nothingiisreal/L3-8B-Celeste-V1.2)
-### Configuration
-The following YAML configuration was used to produce this model:
-```yaml
-models:
-slices:
-- sources:
-  - layer_range: [0, 4]
-    model: Nitral-AI/Hathor_Tahsin-L3-8B-v0.85
-- sources:
-  - layer_range: [1, 5]
-    model: ArliAI/ArliAI-Llama-3-8B-Formax-v1.0
-- sources:
-  - layer_range: [4, 8]
-    model: Nitral-AI/Hathor_Tahsin-L3-8B-v0.85
-- sources:
-  - layer_range: [5, 9]
-    model: ArliAI/ArliAI-Llama-3-8B-Formax-v1.0
-- sources:
-  - layer_range: [8, 10]
-    model: Sao10K/L3-8B-Niitama-v1
-- sources:
-  - layer_range: [6, 14]
-    model: nothingiisreal/L3-8B-Celeste-V1.2
-- sources:
-  - layer_range: [0, 6]
-    model: parts/celeniit14-20.sl
-- sources:
-  - layer_range: [20, 23]
-    model: Sao10K/L3-8B-Niitama-v1
-- sources:
-  - layer_range: [22, 26]
-    model: Nitral-AI/Hathor_Tahsin-L3-8B-v0.85
-- sources:
-  - layer_range: [22, 28]
-    model: nothingiisreal/L3-8B-Celeste-V1.2
-- sources:
-  - layer_range: [25, 27]
-    model: Nitral-AI/Hathor_Tahsin-L3-8B-v0.85
-- sources:
-  - layer_range: [28, 30]
-    model: Sao10K/L3-8B-Niitama-v1
-- sources:
-  - layer_range: [25, 32]
-    model: nothingiisreal/L3-8B-Celeste-V1.2
-parameters:
-  int8_mask: true
-merge_method: passthrough
-dtype: bfloat16
-```

+---
+base_model:
+- Sao10K/L3-8B-Niitama-v1
+- Nitral-AI/Hathor_Tahsin-L3-8B-v0.85
+- ArliAI/ArliAI-Llama-3-8B-Formax-v1.0
+- nothingiisreal/L3-8B-Celeste-V1.2
+library_name: transformers
+tags:
+- mergekit
+- merge
+---
+![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/667eea5cdebd46a5ec4dcc3d/YkA856HMNfrjBxFUOkxtP.jpeg)
+1/3 of the 13B models for Horizon Himerus (will update with link later). This merge was orginally suppose to be only for that final model, but this guy is suprisangly competent.
+A tad jank, but very solid for what it is.
+### Quants
+[OG Q8 GGUF](https://huggingface.co/kromeurus/L3-Himerus-Basis.C-13B-Q8-GGUF) by me.
+Other quants are not available, yet.
+### Details & Recommended Settings
+(Still testing; nothing here is finalized.)
+Follows intructions fairly well for RP and eRP. Dramatic as fuck at times, depending on the senario. Human dialogue and lots of it.
+Rec. Settings:
+```
+Template: Model Default
+Temperature: 1.22
+Min P: 0.115
+Repeat Penelty: 1.05
+Repeat Penelty Tokens: 256
+```
+### Models Merged & Merge Theory
+The following models were included in the merge:
+* [Sao10K/L3-8B-Niitama-v1](https://huggingface.co/Sao10K/L3-8B-Niitama-v1)
+* [Nitral-AI/Hathor_Tahsin-L3-8B-v0.85](https://huggingface.co/Nitral-AI/Hathor_Tahsin-L3-8B-v0.85)
+* [ArliAI/ArliAI-Llama-3-8B-Formax-v1.0](https://huggingface.co/ArliAI/ArliAI-Llama-3-8B-Formax-v1.0)
+* [nothingiisreal/L3-8B-Celeste-V1.2](https://huggingface.co/nothingiisreal/L3-8B-Celeste-V1.2)
+So you're not suppose to mix models with different trained context limits, but I did it anyway. Wanted the 'human' output of Celeste v1.2 while curbing the repitition and adding
+some back up from Niitama and Hathor Tahsin. Formax was included in the beginning for it's instruct following.
+Took a page out of [@matchaaaaa](https://huggingface.co/matchaaaaa)'s Chaifighter Latte and took out a slice of Celeste and Nittama in the center for smoothing out layer disparity.
+I realized while testing that using that 'splice' metheod, you could theoretically make a pretty big model then squish it down to streamline the layers. So, after much testing,
+I came up with the following merges.
+### Config
+```yaml
+models:
+slices:
+- sources:
+  - layer_range: [14, 20]
+    model: nothingiisreal/L3-8B-Celeste-V1.2
+parameters:
+  int8_mask: true
+merge_method: passthrough
+dtype: bfloat16
+name: celeste14-20.sl
+---
+models:
+slices:
+- sources:
+  - layer_range: [14, 20]
+    model: Sao10K/L3-8B-Niitama-v1
+parameters:
+  int8_mask: true
+merge_method: passthrough
+dtype: bfloat16
+name: niitama14-20.sl
+---
+models:
+  - model: celeste14-20.sl
+    parameters:
+      weight: [1, 0.75, 0.625, 0.5, 0.375, 0.25, 0]
+  - model: niitama14-20.sl
+    parameters:
+      weight: [0, 0.25, 0.375, 0.5, 0.625, 0.75, 1]
+merge_method: dare_linear
+base_model: celeste14-20.sl
+dtype: bfloat16
+name: celeniit14-20.sl
+---
+models:
+slices:
+- sources:
+  - layer_range: [0, 4]
+    model: Nitral-AI/Hathor_Tahsin-L3-8B-v0.85
+- sources:
+  - layer_range: [1, 5]
+    model: ArliAI/ArliAI-Llama-3-8B-Formax-v1.0
+- sources:
+  - layer_range: [4, 8]
+    model: Nitral-AI/Hathor_Tahsin-L3-8B-v0.85
+- sources:
+  - layer_range: [5, 9]
+    model: ArliAI/ArliAI-Llama-3-8B-Formax-v1.0
+- sources:
+  - layer_range: [8, 10]
+    model: Sao10K/L3-8B-Niitama-v1
+- sources:
+  - layer_range: [6, 14]
+    model: nothingiisreal/L3-8B-Celeste-V1.2
+- sources:
+  - layer_range: [0, 6]
+    model: celeniit14-20.sl
+- sources:
+  - layer_range: [20, 23]
+    model: Sao10K/L3-8B-Niitama-v1
+- sources:
+  - layer_range: [22, 26]
+    model: Nitral-AI/Hathor_Tahsin-L3-8B-v0.85
+- sources:
+  - layer_range: [22, 28]
+    model: nothingiisreal/L3-8B-Celeste-V1.2
+- sources:
+  - layer_range: [25, 27]
+    model: Nitral-AI/Hathor_Tahsin-L3-8B-v0.85
+- sources:
+  - layer_range: [28, 30]
+    model: Sao10K/L3-8B-Niitama-v1
+- sources:
+  - layer_range: [25, 32]
+    model: nothingiisreal/L3-8B-Celeste-V1.2
+parameters:
+  int8_mask: true
+merge_method: passthrough
+dtype: bfloat16
+```