On Using Metadata and Compression Algorithms to Cluster Heterogeneous Documents from a Semantic Point of View.
Alexandra CernianDorin CarstoiuValentin SgarciuPublished in: ICSEA (2010)
Keyphrases
- compression algorithm
- metadata
- semantically rich
- semantic metadata
- semantic information
- semantic content
- image compression
- semantic search
- data compression
- compression ratio
- digital documents
- semantic web technologies
- compression scheme
- heterogeneous data
- semantic relationships
- metadata creation
- controlled vocabulary
- bitstream
- digital libraries
- multimedia documents
- document space
- electronic documents
- metadata extraction
- quadtree decomposition
- learning objects
- heterogeneous sources
- lossless data compression
- wavelet compression
- xml documents
- wavelet based image
- xml format
- databases
- document repository
- document collections
- information retrieval systems
- clustering algorithm
- document repositories
- information retrieval
- semantic data
- document set
- search interface
- document clustering
- cosine transform
- natural language
- multimedia
- feature selection
- file size
- block coding
- domain ontology
- binary images
- web documents
- semantic web
- multiscale
- user generated tags