Chat Disentanglement: Data for New Domains and Methods for More Accurate Annotation.
Sai R. GouravajhalaAndrew M. VernierYiming ShiZihan LiMark S. AckermanJonathan K. KummerfeldPublished in: ALTA (2023)
Keyphrases
- data sets
- data mining methods
- high quality
- data collection
- high dimensional data
- synthetic data
- data analysis
- raw data
- application domains
- image data
- data mining techniques
- data reduction
- data mining applications
- spectral clustering
- original data
- statistical methods
- statistical tests
- complex structures
- data processing
- knowledge discovery
- preprocessing
- data structure
- statistical analysis
- input data
- missing data
- missing values
- data points
- significant improvement
- prior knowledge
- decision trees
- metadata
- machine learning
- database