TheBloke
/

Rose-Kimiko-20B-fp16

PEFT

Safetensors

llama

Generated from Trainer

Model card Files Files and versions Community

TheBloke commited on Dec 14, 2023

Commit

d99f20a

•

1 Parent(s): e61fe8f

Upload README.md

Browse files

Files changed (1) hide show

README.md +190 -0

README.md ADDED Viewed

	@@ -0,0 +1,190 @@

+---
+base_model: nRuaif/Rose-Kimiko-20B
+inference: false
+library_name: peft
+license: llama2
+model-index:
+- name: Rose-Kimiko
+ results: []
+model_creator: nRuaif
+model_name: Rose Kimiko 20B
+model_type: llama
+prompt_template: 'Below is an instruction that describes a task. Write a response
+ that appropriately completes the request.
+ ### Instruction:
+ {prompt}
+ ### Response:
+ '
+quantized_by: TheBloke
+tags:
+- generated_from_trainer
+---
+<!-- header start -->
+<!-- 200823 -->
+<div style="width: auto; margin-left: auto; margin-right: auto">
+<img src="https://i.imgur.com/EBdldam.jpg" alt="TheBlokeAI" style="width: 100%; min-width: 400px; display: block; margin: auto;">
+</div>
+<div style="display: flex; justify-content: space-between; width: 100%;">
+ <div style="display: flex; flex-direction: column; align-items: flex-start;">
+ <p style="margin-top: 0.5em; margin-bottom: 0em;"><a href="https://discord.gg/theblokeai">Chat & support: TheBloke's Discord server</a></p>
+ </div>
+ <div style="display: flex; flex-direction: column; align-items: flex-end;">
+ <p style="margin-top: 0.5em; margin-bottom: 0em;"><a href="https://www.patreon.com/TheBlokeAI">Want to contribute? TheBloke's Patreon page</a></p>
+ </div>
+</div>
+<div style="text-align:center; margin-top: 0em; margin-bottom: 0em"><p style="margin-top: 0.25em; margin-bottom: 0em;">TheBloke's LLM work is generously supported by a grant from <a href="https://a16z.com">andreessen horowitz (a16z)</a></p></div>
+<hr style="margin-top: 1.0em; margin-bottom: 1.0em;">
+<!-- header end -->
+# Rose Kimiko 20B - FP16
+- Model creator: [nRuaif](https://huggingface.co/nRuaif)
+- Original model: [Rose Kimiko 20B](nRuaif/Rose-Kimiko-20B)
+<!-- description start -->
+## Description
+This repo contains pytorch format fp16 model files for [nRuaif's Rose Kimiko 20B](nRuaif/Rose-Kimiko-20B).
+It is the result of either merging a LoRA, or converting the source repository to float16.
+These files were quantised using hardware kindly provided by [Massed Compute](https://massedcompute.com/).
+<!-- description end -->
+<!-- repositories-available start -->
+## Repositories available
+* [AWQ model(s) for GPU inference.](https://huggingface.co/TheBloke/Rose-Kimiko-20B-AWQ)
+* [GPTQ models for GPU inference, with multiple quantisation parameter options.](https://huggingface.co/TheBloke/Rose-Kimiko-20B-GPTQ)
+* [2, 3, 4, 5, 6 and 8-bit GGUF models for CPU+GPU inference](https://huggingface.co/TheBloke/Rose-Kimiko-20B-GGUF)
+* [Unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/TheBloke/Rose-Kimiko-20B-fp16)
+* [nRuaif's original LoRA adapter, which can be merged on to the base model.](https://huggingface.co/nRuaif/Rose-Kimiko-20B)
+<!-- repositories-available start -->
+<!-- prompt-template start -->
+## Prompt template: Alpaca
+```
+Below is an instruction that describes a task. Write a response that appropriately completes the request.
+### Instruction:
+{prompt}
+### Response:
+```
+<!-- prompt-template end -->
+<!-- footer start -->
+<!-- 200823 -->
+## Discord
+For further support, and discussions on these models and AI in general, join us at:
+[TheBloke AI's Discord server](https://discord.gg/theblokeai)
+## Thanks, and how to contribute
+Thanks to the [chirper.ai](https://chirper.ai) team!
+Thanks to Clay from [gpus.llm-utils.org](llm-utils)!
+I've had a lot of people ask if they can contribute. I enjoy providing models and helping people, and would love to be able to spend even more time doing it, as well as expanding into new projects like fine tuning/training.
+If you're able and willing to contribute it will be most gratefully received and will help me to keep providing more models, and to start work on new AI projects.
+Donaters will get priority support on any and all AI/LLM/model questions and requests, access to a private Discord room, plus other benefits.
+* Patreon: https://patreon.com/TheBlokeAI
+* Ko-Fi: https://ko-fi.com/TheBlokeAI
+**Special thanks to**: Aemon Algiz.
+**Patreon special mentions**: Michael Levine, 阿明, Trailburnt, Nikolai Manek, John Detwiler, Randy H, Will Dee, Sebastain Graf, NimbleBox.ai, Eugene Pentland, Emad Mostaque, Ai Maven, Jim Angel, Jeff Scroggin, Michael Davis, Manuel Alberto Morcote, Stephen Murray, Robert, Justin Joy, Luke @flexchar, Brandon Frisco, Elijah Stavena, S_X, Dan Guido, Undi ., Komninos Chatzipapas, Shadi, theTransient, Lone Striker, Raven Klaugh, jjj, Cap'n Zoog, Michel-Marie MAUDET (LINAGORA), Matthew Berman, David, Fen Risland, Omer Bin Jawed, Luke Pendergrass, Kalila, OG, Erik Bjäreholt, Rooh Singh, Joseph William Delisle, Dan Lewis, TL, John Villwock, AzureBlack, Brad, Pedro Madruga, Caitlyn Gatomon, K, jinyuan sun, Mano Prime, Alex, Jeffrey Morgan, Alicia Loh, Illia Dulskyi, Chadd, transmissions 11, fincy, Rainer Wilmers, ReadyPlayerEmma, knownsqashed, Mandus, biorpg, Deo Leter, Brandon Phillips, SuperWojo, Sean Connelly, Iucharbius, Jack West, Harry Royden McLaughlin, Nicholas, terasurfer, Vitor Caleffi, Duane Dunston, Johann-Peter Hartmann, David Ziegler, Olakabola, Ken Nordquist, Trenton Dambrowitz, Tom X Nguyen, Vadim, Ajan Kanaga, Leonard Tan, Clay Pascal, Alexandros Triantafyllidis, JM33133, Xule, vamX, ya boyyy, subjectnull, Talal Aujan, Alps Aficionado, wassieverse, Ari Malik, James Bentley, Woland, Spencer Kim, Michael Dempsey, Fred von Graf, Elle, zynix, William Richards, Stanislav Ovsiannikov, Edmond Seymore, Jonathan Leane, Martin Kemka, usrbinkat, Enrico Ros
+Thank you to all my generous patrons and donaters!
+And thank you again to a16z for their generous grant.
+<!-- footer end -->
+# Original model card: nRuaif's Rose Kimiko 20B
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+[<img src="https://raw.githubusercontent.com/OpenAccess-AI-Collective/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/OpenAccess-AI-Collective/axolotl)
+# qlora-out
+This model is a fine-tuned version of [tavtav/Rose-20B](https://huggingface.co/tavtav/Rose-20B) on the Kimiko dataset.
+## Model description
+The prompt formats used is ShareGPT/Vicuna format.
+## Intended uses & limitations
+Per many people requests, this LoRA is intended to fix spelling from Rose 20B.
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 0.0002
+- train_batch_size: 2
+- eval_batch_size: 2
+- seed: 42
+- gradient_accumulation_steps: 4
+- total_train_batch_size: 8
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: cosine
+- lr_scheduler_warmup_steps: 10
+- num_epochs: 2
+### Training results
+### Framework versions
+- Transformers 4.36.0.dev0
+- Pytorch 2.0.1+cu118
+- Datasets 2.15.0
+- Tokenizers 0.15.0
+## Training procedure
+The following `bitsandbytes` quantization config was used during training:
+- quant_method: bitsandbytes
+- load_in_8bit: False
+- load_in_4bit: True
+- llm_int8_threshold: 6.0
+- llm_int8_skip_modules: None
+- llm_int8_enable_fp32_cpu_offload: False
+- llm_int8_has_fp16_weight: False
+- bnb_4bit_quant_type: nf4
+- bnb_4bit_use_double_quant: True
+- bnb_4bit_compute_dtype: bfloat16
+### Framework versions
+- PEFT 0.6.0