pszemraj
/

led-base-book-summary

text2text-generation

Inference Endpoints

Model card Files Files and versions Community

pszemraj commited on Jun 7, 2022

Commit

526eaab

•

1 Parent(s): 4948aa8

update desc

Files changed (1) hide show

README.md +4 -2

README.md CHANGED Viewed

@@ -68,9 +68,11 @@ inference:
 - **Use cases:** long narrative summarization (think stories - as the dataset intended), article/paper/textbook/other summarization, technical:simple summarization.
  - Models trained on this dataset tend to also _explain_ what they are summarizing, which IMO is awesome.
-- This is an 'upgraded' version of [`pszemraj/led-base-16384-finetuned-booksum`](https://huggingface.co/pszemraj/led-base-16384-finetuned-booksum), it has trained for a total of six more epochs with the parameters adjusted for _very_ fine-tuning type training (super low LR, etc)
  - all the parameters for generation on the API are the same for easy comparison between versions.
-- works well on lots of text, can hand 16384 tokens/batch.
 ## Other Checkpoints on Booksum

 - **Use cases:** long narrative summarization (think stories - as the dataset intended), article/paper/textbook/other summarization, technical:simple summarization.
  - Models trained on this dataset tend to also _explain_ what they are summarizing, which IMO is awesome.
+- Trained for 16 epochs vs. [`pszemraj/led-base-16384-finetuned-booksum`](https://huggingface.co/pszemraj/led-base-16384-finetuned-booksum),
+ - parameters adjusted for _very_ fine-tuning type training (super low LR, etc)
  - all the parameters for generation on the API are the same for easy comparison between versions.
+- works well on lots of text, and can hand 16384 tokens/batch.
 ## Other Checkpoints on Booksum