BERT for Long Documents: A Case Study of Automated ICD Coding.
Arash AfkanpourShabir AdeelHansenclever BassaniArkady EpshteynHongbo FanIsaac JonesMahan MalihiAdrian NauthRaj SinhaSanjana WoonnaShiva ZamaniElli KanalMikhail FomitchevDonny CheungPublished in: CoRR (2022)
Keyphrases
- free text
- document collections
- information retrieval
- coding scheme
- information retrieval systems
- relevant documents
- semi automated
- metadata extraction
- medical records
- patient records
- fully automated
- text documents
- case study
- document retrieval
- document classification
- test bed
- legal documents
- xml documents
- keywords
- metadata
- vector space
- latent semantic analysis
- digital documents
- retrieval systems
- user queries
- test collection
- structured data
- web documents
- retrieved documents
- co occurrence
- document content