Login / Signup
Thuat Nguyen
Publication Activity (10 Years)
Years Active: 2020-2024
Publications (10 Years): 5
Top Topics
Language Modelling
Relevance Model
Cross Lingual
N Gram
Top Venues
CoRR
LREC/COLING
EMNLP (Demos)
EMNLP (1)
</>
Publications
</>
Thuat Nguyen
,
Chien Van Nguyen
,
Viet Dac Lai
,
Hieu Man
,
Nghia Trung Ngo
,
Franck Dernoncourt
,
Ryan A. Rossi
,
Thien Huu Nguyen
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages.
LREC/COLING
(2024)
Thuat Nguyen
,
Chien Van Nguyen
,
Viet Dac Lai
,
Hieu Man
,
Nghia Trung Ngo
,
Franck Dernoncourt
,
Ryan A. Rossi
,
Thien Huu Nguyen
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages.
CoRR
(2023)
Viet Dac Lai
,
Chien Nguyen
,
Nghia Trung Ngo
,
Thuat Nguyen
,
Franck Dernoncourt
,
Ryan A. Rossi
,
Thien Huu Nguyen
Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback.
EMNLP (Demos)
(2023)
Viet Dac Lai
,
Chien Van Nguyen
,
Nghia Trung Ngo
,
Thuat Nguyen
,
Franck Dernoncourt
,
Ryan A. Rossi
,
Thien Huu Nguyen
Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback.
CoRR
(2023)
Hieu Man Duc Trong
,
Duc-Trong Le
,
Amir Pouran Ben Veyseh
,
Thuat Nguyen
,
Thien Huu Nguyen
Introducing a New Dataset for Event Detection in Cybersecurity Texts.
EMNLP (1)
(2020)