Cocytus: parallel NLP over disparate data.
Noah EvansMasayuki AsaharaYuji MatsumotoPublished in: Trait. Autom. des Langues (2008)
Keyphrases
- data sets
- data sources
- data collection
- synthetic data
- data points
- original data
- sensor data
- data processing
- natural language processing
- statistical analysis
- high quality
- training data
- feature selection
- complex data
- database
- data distribution
- missing data
- data objects
- historical data
- image data
- data mining
- data mining techniques
- information extraction
- high dimensional data
- prior knowledge
- xml documents
- experimental data
- relational databases
- natural language
- artificial intelligence
- learning algorithm