Login / Signup
Combining Content-Based and URL-Based Heuristics to Harvest Aligned Bitexts from Multilingual Sites with Bitextor.
Miquel Esplà-Gomis
Mikel L. Forcada
Published in:
Prague Bull. Math. Linguistics (2010)
Keyphrases
</>
website
image retrieval
web pages
digital libraries
databases
information filtering
heuristic search
text categorization
neural network
data sets
retrieval method
language resources
visual information retrieval
multimedia
lower bound
genetic algorithm
natural language processing
multi lingual