SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 14 Languages.
Nedjma OusidhoumShamsuddeen Hassan MuhammadMohamed AbdallaIdris AbdulmuminIbrahim Said AhmadSanchit AhujaAlham Fikri AjiVladimir AraujoAbinew Ali AyelePavan BaswaniMeriem BeloucifChris BiemannSofia BourhimChristine de KockGenet Shanko DekeboOumaima HourraneGopichand KanumoluLokesh MadasuSamuel RutundaManish ShrivastavaThamar SolorioNirmal SurangeHailegnaw Getaneh TilayeKrishnapriya VishnubhotlaGenta Indra WinataSeid Muhie YimamSaif M. MohammadPublished in: CoRR (2024)
Keyphrases
- data model
- database
- databases
- natural language
- semantic similarity
- intermediate representations
- co occurrence
- domain ontology
- relatedness measure
- semantic features
- semantic annotation
- benchmark datasets
- semantic web
- metadata
- transfer learning
- semantic network
- language independent
- textual descriptions
- semantic information
- document collections
- expressive power
- domain specific
- knowledge representation
- photo collections
- multi lingual
- data sets