BuDDIE: A Business Document Dataset for Multi-task Information Extraction.
Ran ZmigrodDongsheng WangMathieu SibueYulong PeiPetr BabkinIvan BrugereXiaomo LiuNacho NavarroAntony PapadimitriouWilliam WatsonZhiqiang MaArmineh NourbakhshSameena ShahPublished in: CoRR (2024)
Keyphrases
- multi task
- information extraction
- multi task learning
- text documents
- learning tasks
- multitask learning
- information retrieval
- multiple tasks
- multi class
- feature selection
- natural language processing
- sparse learning
- metric learning
- transfer learning
- gaussian processes
- text mining
- learning problems
- text summarization
- data mining
- machine learning
- collective classification
- learning algorithm
- kernel methods
- conditional random fields
- semi supervised learning
- text classification
- reinforcement learning