Evaluating a Topic Modelling Approach to Measuring Corpus Similarity.
Richard FothergillPaul CookTimothy BaldwinPublished in: LREC (2016)
Keyphrases
- similarity measure
- topic segmentation
- text corpora
- topic tracking
- topic detection and tracking
- similarity function
- word pairs
- topic models
- distance measure
- scientific papers
- word frequency
- concept space
- document level
- manually annotated
- machine learning
- user defined
- euclidean distance
- test set
- similarity metric
- semantic similarity
- distance function
- document corpus
- conversational speech