Login / Signup
Constructing High Quality Bilingual Corpus using Parallel Data from the Web.
Sai Man Cheok
Lap-Man Hoi
Su-Kit Tang
Rita Tse
Published in:
IoTBDS (2022)
Keyphrases
</>
high quality
database
raw data
data collection
data processing
data sources
data sets
data analysis
computer systems
image data
training data
website
web data
textual data
end users
data points
input data
information sources
web mining
web content
log files
low quality