A Class Library for the Integration of NLP Tools: Definition and implementation of an Abstract Data Type Collection for the manipulation of SGML documents in a context of stand-off linguistic annotation.
Xabier ArtolaArantza Díaz de Ilarraza SánchezNerea EzeizaKoldo GojenolaGregorio HernándezAitor SoroaPublished in: LREC (2002)
Keyphrases
- electronic documents
- document collections
- natural language processing
- pdf files
- linguistic analysis
- abstract data types
- natural language
- metadata
- structured documents
- data abstraction
- logical structure
- digital collections
- free text
- information extraction
- database
- hand crafted
- xml documents
- digital libraries
- web documents
- controlled vocabulary
- denotational semantics
- spatial data
- relevant documents
- data types
- semantic information
- question answering
- wordnet
- data management
- programming language
- data mining