The ALVIS Format for Linguistically Annotated Documents.
Adeline NazarenkoÉrick AlphonseJulien DerivièreThierry HamonGuillaume VauvertDavy WeissenbacherPublished in: LREC (2006)
Keyphrases
- metadata
- xml format
- electronic documents
- human readable
- information retrieval systems
- document collections
- web documents
- manually constructed
- pdf files
- information retrieval
- keywords
- xml documents
- document retrieval
- file formats
- pdf documents
- document classification
- relevant documents
- similarity measure
- text documents
- vector space model
- retrieval systems
- free text
- plain text
- extensible markup language
- document clustering
- multimedia documents
- vector space
- linguistic knowledge
- digital documents
- manually annotated
- document analysis
- legal documents
- multi document summarization
- multimedia