Use of GitHub as a platform for open collaboration on text documents.
Justin LongoTanya M. KelleyPublished in: OpenSym (2015)
Keyphrases
- text documents
- text mining
- text classification
- text categorization
- information extraction
- keywords
- topic models
- wordnet
- document classification
- text data
- news articles
- tf idf
- textual information
- text collections
- bag of words
- named entities
- document clustering
- text corpus
- relevant concepts
- machine learning
- image segmentation
- training data
- clustering algorithm
- knowledge base
- search engine
- neural network
- information extraction systems
- automatic text categorization