TransCasm: A Bilingual Corpus of Sarcastic Tweets.
Desline SimonSheila CastilhoPintu LoharHaithem AfliPublished in: PoliticalNLP@LREC (2022)
Keyphrases
- parallel corpus
- multiword
- parallel corpora
- chinese english
- sentence pairs
- statistical machine translation
- social media
- comparable corpora
- machine translation
- news articles
- parallel texts
- cross language information retrieval
- manually annotated
- topic tracking
- machine translation system
- word pairs
- bilingual lexicon
- cross lingual
- named entities
- test set
- query translation
- probabilistic model
- wikipedia articles
- cross language
- word alignment
- context sensitive
- wordnet
- short texts
- query expansion
- text mining