Daedalus: Safer Document Parsing.
Iavor S. DiatchkiMike DoddsHarrison GoldsteinBill HarrisDavid A. HollandBenoît RazetCole SchlesingerSimon WinwoodPublished in: Proc. ACM Program. Lang. (2024)
Keyphrases
- information retrieval
- information retrieval systems
- document collections
- web documents
- natural language
- keywords
- document images
- natural language processing
- database
- document retrieval
- knowledge engineering
- tf idf
- context free grammars
- bayesian networks
- information extraction
- clustering algorithm
- multimedia
- relevant documents
- vector space model
- multimedia documents
- syntactic analysis