The ALVIS Format for Linguistically Annotated Documents
Adeline NazarenkoÉrick AlphonseJulien DerivièreThierry HamonGuillaume VauvertDavy WeissenbacherPublished in: CoRR (2006)
Keyphrases
- metadata
- file formats
- information retrieval
- human readable
- electronic documents
- xml format
- manually constructed
- xml documents
- pdf documents
- document collections
- document clustering
- document classification
- relevant documents
- web documents
- document content
- plain text
- databases
- information retrieval systems
- database
- manually annotated
- multimedia
- keywords
- free text
- document retrieval
- vector space model
- structured documents
- pdf files
- text documents
- text retrieval
- digital libraries
- structured data
- extensible markup language
- legal documents
- digital documents
- wordnet
- expert finding
- document analysis
- retrieved documents
- ranked list