The Text Encoding Initiative: Flexible and Extensible Document Encoding.
David T. BarnardNancy IdePublished in: J. Am. Soc. Inf. Sci. (1997)
Keyphrases
- information retrieval
- keywords
- database
- web documents
- text documents
- encoding scheme
- text clustering
- document collections
- document images
- scientific documents
- text mining
- semantic information
- bag of words
- information extraction
- document representation
- textual data
- markup language
- document analysis
- data model
- textual content
- keyword extraction
- automatic text summarization