Pipelines for Procedural Information Extraction from Scientific Literature: Towards Recipes using Machine Learning and Data Science.
Huichen YangCarlos A. AguirreMaria F. De La TorreDerek ChristensenLuis BobadillaEmily DavichJordan RothLei LuoYihong TheisAlice LamThomas Yong-Jin HanDavid ButtlerWilliam H. HsuPublished in: CoRR (2019)
Keyphrases
- data science
- scientific literature
- machine learning
- text mining
- information extraction
- big data
- text processing
- unstructured text
- statistical learning
- natural language processing
- biomedical literature
- scientific papers
- text documents
- information retrieval
- data analysis
- scientific articles
- knowledge discovery
- text classification
- document clustering
- digital libraries
- data mining
- named entities
- structured data
- machine learning methods
- learning algorithm
- data sets
- cloud computing
- data management
- semi supervised learning
- natural language
- similarity measure
- topic models
- data processing
- supervised learning
- active learning
- feature selection