Login / Signup
Large-scale text processing pipeline with Apache Spark.
Alexey Svyatkovskiy
Kosuke Imai
Mary Kroeger
Yuki Shiraito
Published in:
CoRR (2019)
Keyphrases
</>
processing pipeline
open source
real world
information retrieval
text documents
text retrieval
real life
text analysis
web server
database
small scale
web documents
text mining
open source software
news stories
textual information
textual data
data mining
natural language text
mailing lists