Topic Segmentation of Semi-Structured and Unstructured Conversational Datasets using Language Models.
Reshmi GhoshHarjeet Singh KajalSharanya KamathDhuri ShrivastavaSamyadeep BasuHansi ZengSoundararajan SrinivasanPublished in: CoRR (2023)
Keyphrases
- semi structured
- language model
- topic segmentation
- language modeling
- structured data
- n gram
- document retrieval
- information extraction
- probabilistic model
- query expansion
- information retrieval
- web documents
- data model
- test collection
- retrieval model
- text mining
- relevance model
- vector space model
- database
- query terms
- data sets
- document representation
- passage retrieval