Large, Multilingual, Broadcast News Corpora for Cooperative Research in Topic Detection and Tracking: The TDT-2 and TDT-3 Corpus Efforts.
Christopher CieriDavid GraffMark LibermanNii MarteyStephanie M. StrasselPublished in: LREC (2000)
Keyphrases
- topic detection and tracking
- broadcast news
- story link detection
- topic tracking
- news stories
- topic detection
- automatic speech recognition
- video search
- news video
- vector space model
- document representation
- text data
- natural language processing
- digital libraries
- language independent
- machine learning
- language processing
- meta search
- dependency structure
- news articles
- document clustering
- text mining