A Large-Scale Web Data Collection as a Natural Language Processing Infrastructure.
Keiji ShinzatoDaisuke KawaharaChikara HashimotoSadao KurohashiPublished in: LREC (2008)
Keyphrases
- data collection
- natural language processing
- web scale
- web applications
- machine learning
- website
- chinese web
- textual data
- sensor networks
- wireless sensor networks
- data analysis
- real world
- information extraction
- text mining
- highly distributed
- web mining
- named entity recognition
- small scale
- semantic web
- web resources
- linked data
- text processing
- web documents
- computational linguistics
- computational biology
- semantic analysis
- web users
- word sense disambiguation
- information sources
- end users
- expert systems
- natural language
- sentiment analysis
- question answering
- database
- knowledge base
- semantic technologies
- data entry
- artificial intelligence
- databases