New Word Extraction From Chinese Financial Documents.
Liwei YanBo BaiWei ChenDapeng Oliver WuPublished in: IEEE Signal Process. Lett. (2017)
Keyphrases
- word segmentation
- chinese text
- word spotting
- word frequencies
- keywords
- chinese word segmentation
- document collections
- keyword extraction
- listed companies
- text corpus
- bilingual lexicon
- natural language text
- information extraction
- information retrieval
- unknown words
- sentence level
- information retrieval systems
- chinese text retrieval
- text documents
- related words
- document analysis
- multiword
- printed documents
- latent topics
- relevant documents
- handwritten documents
- linguistic information
- word pairs
- word recognition
- xml documents
- web documents
- concept space
- word frequency
- related documents
- co occurrence
- retrieval systems
- stop words
- text retrieval
- document retrieval
- stock market
- document clustering
- cross language
- handwriting recognition
- natural language processing
- text corpora
- n gram
- word sense
- natural language
- page layout
- text classification