Parametric Weighting of Parallel Data for Statistical Machine Translation.
Kashif ShahLoïc BarraultHolger SchwenkPublished in: IJCNLP (2011)
Keyphrases
- data sets
- data processing
- data analysis
- training data
- original data
- raw data
- historical data
- database
- high dimensional data
- complex data
- synthetic data
- data quality
- statistical analysis
- data collection
- data mining techniques
- image data
- knowledge discovery
- data structure
- real time
- information retrieval
- data points
- xml documents
- prior knowledge
- data sources
- small number
- relational databases
- semi supervised
- high quality
- missing values
- database systems
- feature selection
- noisy data
- end users
- parallel processing
- databases
- probability distribution