Proceedings of the Second Louhi Workshop on Text and Data Mining of Health Documents, Louhi@NAACL-HLT 2010, Los Angeles, CA, USA, June 5, 2010
Published in: Louhi@NAACL-HLT (2010)
Keyphrases
- naacl hlt
- los angeles
- natural language processing
- free text
- text mining
- text documents
- data mining
- computer personnel research group
- textual data
- information extraction
- text data
- information retrieval
- web documents
- digital documents
- machine learning
- cellular automata
- wordnet
- plagiarism detection
- latent semantic analysis
- keywords
- textual content
- document analysis
- document clustering
- student research workshop
- document content
- natural language text
- electronic documents
- computational linguistics
- knowledge discovery
- text retrieval
- natural language
- text collections
- semi supervised learning
- multimedia documents
- association rules
- printed documents
- document collections
- relevant documents
- metadata
- text classification
- semantic information
- text categorization
- related documents
- text summarization
- data analysis
- probabilistic model
- health care
- text lines
- xml documents
- web pages
- data analysis and data mining