Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Datasets:
HuggingFaceFW
/
fineweb
like
1.16k
Tasks:
Text Generation
Languages:
English
Size Categories:
n>1T
ArXiv:
arxiv:
2306.01116
arxiv:
2109.07445
Tags:
Croissant
DOI:
doi:10.57967/hf/2092
License:
odc-by
Dataset card
Viewer
Files
Files and versions
Community
22
refs/convert/parquet
fineweb
/
default
3 contributors
History:
246 commits
parquet-converter
Update parquet files (step 47 of 47)
551adaf
verified
25 days ago
train-part0
Update parquet files (step 40 of 66)
25 days ago
train-part1
Update parquet files (step 40 of 47)
25 days ago
train-part2
Update parquet files (step 47 of 47)
25 days ago