Datasets: A Community Library for Natural Language Processing.
Quentin LhoestAlbert Villanova del MoralYacine JerniteAbhishek ThakurPatrick von PlatenSuraj PatilJulien ChaumondMariama DrameJulien PluLewis TunstallJoe DavisonMario SaskoGunjan ChhablaniBhavitvya MalikSimon BrandeisTeven Le ScaoVictor SanhCanwen XuNicolas PatryAngelina McMillan-MajorPhilipp SchmidSylvain GuggerClément DelangueThéo MatussièreLysandre DebutStas BekmanPierric CistacThibault GoehringerVictor MustarFrançois LagunasAlexander M. RushThomas WolfPublished in: EMNLP (Demos) (2021)
Keyphrases
- natural language processing
- text mining
- machine learning
- information extraction
- benchmark datasets
- semantic analysis
- uci machine learning repository
- natural language
- computational linguistics
- search engine
- named entity recognition
- knowledge resources
- database
- language processing
- word sense disambiguation
- social networks
- neural network