Word Boundary Identification for Myanmar Text Using Conditional Random Fields.
Win Pa PaYe Kyaw ThuAndrew M. FinchEiichiro SumitaPublished in: ICGEC (2) (2015)
Keyphrases
- manually constructed
- text corpus
- keywords
- english text
- natural language text
- lexical features
- text input
- string matching
- linguistic information
- sentence level
- english words
- word level
- noun phrases
- text segments
- word counts
- syntactic information
- word pairs
- text mining
- syntactic analysis
- chinese text
- punctuation marks
- co occurrence
- multiword
- text retrieval
- related words
- sentence similarity
- printed text
- syntactic categories
- text documents
- word sense disambiguation
- stop words
- named entity recognition
- word co occurrence
- page layout
- printed documents
- word sense
- training corpus
- information retrieval
- automatically generated
- unknown words
- named entities
- n gram
- handwritten documents
- named entity recognizer