A Scalable and Distributed NLP Architecture for Web Document Annotation.
Julien DerivièreThierry HamonAdeline NazarenkoPublished in: FinTAL (2006)
Keyphrases
- web documents
- scalable distributed
- information extraction
- distributed architecture
- natural language processing
- distributed processing
- hierarchical architecture
- lightweight
- web pages
- semi structured
- prefetching
- loosely coupled
- distributed systems
- agent based architecture
- peer to peer
- question answering
- fully distributed
- multi agent architecture
- layered architecture
- highly distributed
- master slave
- hand crafted
- natural language
- data intensive
- web logs
- high scalability
- distributed environment
- keywords
- management system
- website
- heterogeneous environments
- active learning
- wordnet
- textual information
- vector space model
- machine learning
- unstructured documents
- map reduce
- multi agent
- web data
- visual features