Experiments on the Construction of a Phonetically Balanced Corpus from the Web.
Luis Villaseñor PinedaManuel Montes-y-GómezDominique VaufreydazJean-François SerignatPublished in: CICLing (2004)
Keyphrases
- website
- web users
- web applications
- semantic web
- web documents
- newspaper articles
- web mining
- construction process
- web technologies
- linked data
- web content
- information sources
- web pages
- data sets
- web data
- web resources
- web intelligence
- link analysis
- manually annotated
- search engine
- topic models
- user experience
- co occurrence
- text mining
- end users
- metadata