Building datasets to support information extraction and structure parsing from electronic theses and dissertations.
William A. IngramJian WuSampanna Yashwant KahuJavaid Akbar ManzoorBipasha BanerjeeAman AhujaMuntabir Hasan ChoudhuryLamia SalsabilWinston ShieldsEdward A. FoxPublished in: Int. J. Digit. Libr. (2024)
Keyphrases
- information extraction
- natural language processing
- natural language
- natural language parsing
- text mining
- web mining
- named entity recognition
- benchmark datasets
- bulletin board
- dependency parsing
- data repositories
- synthetic datasets
- precision and recall
- semi structured
- data sets
- end users
- information retrieval
- machine learning
- data mining