Do We Really Need That Many Parameters In Transformer For Extractive Summarization? Discourse Can Help !
Wen XiaoPatrick HuberGiuseppe CareniniPublished in: CoRR (2020)
Keyphrases
- extractive summarization
- maximum likelihood
- parameter values
- data sets
- sensitivity analysis
- input parameters
- pairwise
- parameter settings
- parameter estimation
- speech acts
- computer mediated
- maximum entropy
- expectation maximization
- fuzzy logic
- artificial neural networks
- natural language
- learning environment
- learning algorithm