A Proposal for the Integration of NLP Tools using SGML-Tagged Documents.
Xabier ArtolaArantza Díaz de Ilarraza SánchezNerea EzeizaKoldo GojenolaA. MaritxalarAitor SoroaPublished in: LREC (2000)
Keyphrases
- electronic documents
- structured documents
- text analysis
- pdf files
- free text
- information retrieval
- digital libraries
- document retrieval
- document analysis
- natural language processing
- document archives
- document collections
- text mining
- information extraction
- document classification
- document structure
- relevant documents
- text documents
- web documents
- information retrieval systems
- management tools
- natural language
- machine learning
- logical structure
- metadata
- data integration
- xml documents
- question answering
- language processing
- digital objects
- keywords
- end users
- document clustering
- web pages
- linguistic analysis
- document management
- extensible markup language
- artificial intelligence
- database