Processing rhetorical, morphosyntactic, and semantic features from corporate technical documents for identifying organizational domain knowledge (S).
Bell Manrique LosadaCarlos Mario Zapata JaramilloPublished in: SEKE (2013)
Keyphrases
- semantic features
- semantic information
- domain knowledge
- document clustering
- expressed in natural language
- wordnet
- wikipedia articles
- low level features
- syntactic features
- linguistic features
- semantic similarity
- knowledge management
- xml documents
- information retrieval systems
- syntactic information
- document collections
- text classification
- information retrieval
- visual features
- natural language processing
- text documents
- metadata
- clustering algorithm
- feature vectors
- prior knowledge
- data mining
- high level
- multiscale
- semantic relations
- feature set
- low level
- text mining
- database