Detecting Noisy Swiss German Web Text Using RNN- and Rule-Based Techniques.
Janis GoldzycherJonathan SchaberPublished in: SwissText/KONVENS (2020)
Keyphrases
- web documents
- text information
- textual data
- information retrieval and extraction
- web applications
- website
- recurrent neural networks
- database
- web pages
- text content
- textual features
- nearest neighbor
- semantic web
- web content
- expert systems
- web images
- information retrieval
- free text
- digital documents
- web data
- text retrieval
- web mining
- information sources
- text mining
- content features
- key concepts
- web technologies
- textual information
- rule base
- linked data
- noisy environments
- high dimensional