XML content warehousing: Improving sociological studies of mailing lists and web data
Benjamin NguyenAntoine VionFrançois-Xavier DudouetDario ColazzoIoana ManolescuPierre SenellartPublished in: CoRR (2011)
Keyphrases
- web data
- mailing lists
- web content
- semi structured
- web mining
- web information
- data repositories
- open source software
- web documents
- semistructured data
- page contents
- metadata
- web pages
- web usage mining
- anti spam filtering
- xml documents
- structured data
- data warehouse
- data model
- source code
- link structure
- databases
- instant messaging
- website
- database systems
- relational databases
- query logs
- information extraction
- keywords
- multimedia
- search engine
- open source