Significance of Bridging Real-world Documents and NLP Technologies.
Tadayoshi HaraGoran TopicYusuke MiyaoAkiko AizawaPublished in: OIAF4HLT@COLING (2014)
Keyphrases
- real world
- text analysis
- data mining
- free text
- document collections
- information retrieval
- wide range
- information retrieval systems
- text documents
- text mining
- natural language processing
- natural language
- document analysis
- vector space model
- document clustering
- xml documents
- text processing
- document classification
- case study
- document retrieval
- web documents
- machine translation
- ranked list
- data sets
- synthetic data
- metadata
- language processing
- keywords
- linguistic analysis
- relevant documents
- structured documents
- web environment
- information society
- extensible markup language
- field of natural language processing