🔘 Laboratory page: github.com/huggingface/datasets
Summary
“Datasets is a lightweight library providing two main features:
– One-line dataloaders for many public datasets: one liners to download and pre-process any of the number of datasets major public datasets (in 467 languages and dialects!) provided on the HuggingFace Datasets Hub and,
– An efficient data pre-processing: simple, fast and reproducible data pre-processing for the above public datasets as well as your own local datasets in CSV/JSON/text.”
Author
Hugging Face. “The AI community building the future. (Build, train and deploy state of the art models powered by the reference open source in natural language processing).Research interests: solving NLP, one commit at a time”. About they and join this org: https://huggingface.co/huggingface
#R0identifier=”9aa989eb74fe4c887ef303fbc654f681″ |
Liked this post? Follow this blog to get more.