Czech News Dataset for Semantic Textual Similarity.
Jakub SidoMichal SejákOndrej PrazákMiloslav KonopíkVáclav MoravecPublished in: CoRR (2021)
Keyphrases
- semantic similarity
- natural language
- keywords
- semantic distance
- similarity measure
- semantic similarity measure
- cross media
- semantic information
- multimedia
- benchmark datasets
- social media
- semantic web
- euclidean distance
- sentence similarity
- similarity relations
- semantic labels
- domain specific
- metadata
- word similarity
- text representation
- cross language
- automatically generated
- news articles
- semantic annotation
- co occurrence
- semantic representation
- news video
- word pairs
- language independent
- semantic concepts
- semantic network