Text classification of column headers with a controlled vocabulary: leveraging LLMs for metadata enrichment.
Margherita MartoranaTobias KuhnLise StorkJacco van OssenbruggenPublished in: CoRR (2024)
Keyphrases
- controlled vocabulary
- text classification
- metadata
- gene ontology
- text mining
- national library of medicine
- free text
- microarray
- journal articles
- machine learning
- feature selection
- databases
- knn
- information theoretic
- semantic similarity
- multimedia content
- artificial intelligence
- semantic technologies
- information retrieval