Hugging Face datasets

“Datasets is a lightweight library providing two main features:
– One-line dataloaders for many public datasets: one liners to download and pre-process any of the number of datasets major public datasets (in 467 languages and dialects!) provided on the HuggingFace Datasets Hub and,
– An efficient data pre-processing: simple, fast and reproducible data pre-processing for the above public datasets as well as your own local datasets in CSV/JSON/text.”


Hugging Face. “The AI community building the future. (Build, train and deploy state of the art models powered by the reference open source in natural language processing).Research interests: solving NLP, one commit at a time”. About they and join this org:

