Publication: Information Extraction in Domain and Generic Documents: Findings from Heuristic-based and Data-driven Approaches.