Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
105
3
7
Josh Bauer
josh-sematic
Follow
21world's profile picture
idrissairtrain's profile picture
joylarkin's profile picture
4 followers
·
3 following
https://github.com/augray/
augray
AI & ML interests
None yet
Organizations
josh-sematic
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
airtrain-ai/fineweb-edu-fortified
23 days ago
Can you add more labels to "The composition of fineweb-edu-fortified, produced by automatically clustering a 500k row sample in Airtrain " picture?
1
#105 opened 23 days ago by
Josephgflowers
New activity in
airtrain-ai/fineweb-edu-fortified
about 1 month ago
No "\n\n" in the dataset?!
1
#104 opened about 2 months ago by
ymh233
New activity in
airtrain-ai/fineweb-edu-fortified
2 months ago
Deduped version of fineweb on HuggingFace yields "This dataset has 218 files that have been marked as unsafe."
1
#103 opened 2 months ago by
egor-pakhomov
New activity in
airtrain-ai/fineweb-edu-fortified
3 months ago
CC-MAIN-2024-10
#102 opened 3 months ago by
josh-sematic