d-matrix
/

gpt2

d-matrix commited on Feb 23

Commit

a3da97a

•

1 Parent(s): 1f48029

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -36,14 +36,14 @@ model-index:
  type: dmx-perlexity
  value: 46.570838928222656
 ---
-This is a d-Matrix functional reference of the GPT2 model family, of the following revisions:
 - [`distilgpt2`](https://huggingface.co/distilbert/distilgpt2)
 - [`gpt2`](https://huggingface.co/openai-community/gpt2)
 - [`gpt2-medium`](https://huggingface.co/openai-community/gpt2-medium)
 - [`gpt2-large`](https://huggingface.co/openai-community/gpt2-large)
-- [`gpt2-xl`](https://huggingface.co/openai-community/gpt2-xl) (default)
-The reference provides the following functional configurations:
  Configuration | Explanation
  :-- | :--
  **`BASELINE`** | a reference functionally equivalent to the original model
@@ -66,7 +66,8 @@ Prerequisites:
 >>> pipe = pipeline(
 >>> "text-generation",
 >>> model="d-matrix/gpt2",
->>> revision="gpt2-xl",
 >>> use_auth_token=os.environ.get("HUGGING_FACE_HUB_TOKEN"),
 >>> trust_remote_code=True,
 >>> # device_map="auto", # enabling model parallel on multi-GPU nodes
@@ -75,8 +76,6 @@ Prerequisites:
 >>> pipe.model, monkey_patched=False, hf=True, input_names=["input_ids", "labels"]
 >>> )
->>> pipe.model.transform("/path/to/BASIC.yaml")
 >>> perplexity = evaluate.load("d-matrix/dmx_perplexity", module_type="metric")
 >>> input_texts = load_dataset("ptb_text_only", "penn_treebank", split="test")["sentence"]
 >>> results = perplexity.compute(model=pipe.model.body, references=input_texts)

  type: dmx-perlexity
  value: 46.570838928222656
 ---
+This is a d-Matrix functional reference of the GPT2 model family, of the following *revisions*:
 - [`distilgpt2`](https://huggingface.co/distilbert/distilgpt2)
 - [`gpt2`](https://huggingface.co/openai-community/gpt2)
 - [`gpt2-medium`](https://huggingface.co/openai-community/gpt2-medium)
 - [`gpt2-large`](https://huggingface.co/openai-community/gpt2-large)
+- [`gpt2-xl`](https://huggingface.co/openai-community/gpt2-xl)
+The reference provides the following functional *configurations*:
  Configuration | Explanation
  :-- | :--
  **`BASELINE`** | a reference functionally equivalent to the original model
 >>> pipe = pipeline(
 >>> "text-generation",
 >>> model="d-matrix/gpt2",
+>>> revision="gpt2-xl",
+>>> dmx_config="BASELINE",
 >>> use_auth_token=os.environ.get("HUGGING_FACE_HUB_TOKEN"),
 >>> trust_remote_code=True,
 >>> # device_map="auto", # enabling model parallel on multi-GPU nodes
 >>> pipe.model, monkey_patched=False, hf=True, input_names=["input_ids", "labels"]
 >>> )
 >>> perplexity = evaluate.load("d-matrix/dmx_perplexity", module_type="metric")
 >>> input_texts = load_dataset("ptb_text_only", "penn_treebank", split="test")["sentence"]
 >>> results = perplexity.compute(model=pipe.model.body, references=input_texts)