A natural language processing and geospatial clustering framework for harvesting local place names from geotagged housing advertisements.
Yingjie HuHuina MaoGrant McKenziePublished in: CoRR (2018)
Keyphrases
- clustering framework
- natural language processing
- named entities
- clustering method
- named entity recognition
- clustering algorithm
- semi supervised
- information extraction
- k means
- text mining
- similarity metric
- text clustering
- natural language
- high dimensional datasets
- machine learning
- question answering
- wordnet
- knowledge representation
- semantic relations
- spatial data
- text summarization
- textual data
- high dimensional data
- high dimensional
- similarity measure
- web pages
- semi supervised learning
- principal component analysis
- knn
- background knowledge
- spectral clustering
- feature extraction
- information retrieval