Login / Signup

SeqPig: simple and scalable scripting for large sequencing data sets in Hadoop.

André SchumacherLuca PiredduMatti NiemenmaaAleksi KallioEija KorpelainenGianluigi ZanettiKeijo Heljanko
Published in: Bioinform. (2014)
Keyphrases
  • data sets
  • open source
  • massive scale
  • information systems
  • real world data sets
  • data intensive
  • database
  • real world
  • training data
  • data streams
  • training set
  • benchmark data sets
  • highly scalable
  • highly reliable