Introducing the CURLICAT Corpora: Seven-language Domain Specific Annotated Corpora from Curated Sources.
Tamás VáradiBence NyékiSvetla KoevaMarko TadicVanja StefanecMaciej OgrodniczukBartlomiej NitonPiotr PezikVerginica Barbu MititeluElena IrimiaMaria MitrofanDan TufisRadovan GarabíkSimon KrekAndraz ReparPublished in: LREC (2022)
Keyphrases
- domain specific
- general purpose
- programming language
- domain independent
- knowledge sources
- specific domains
- parallel corpus
- natural language processing
- natural language
- specification language
- language learning
- linguistic resources
- domain experts
- structured data
- scientific databases
- biomedical literature
- language processing
- database
- information sources
- relational databases
- keywords
- information retrieval
- databases