A new approach to geocoding: BingGC

Real-time geocoders help users find precise locations in online mapping systems. Geocoding unstructured queries can be difficult, as users may describe map locations by referencing several spatially co-located entities (e.g., a business near a street intersection). Serving these queries is important as it provides new capabilities and allows for expanding in markets with less structured postal systems. Traditionally, this problem poses significant difficulties for online systems where latency constraints prevent exhaustive join-based algorithms. Previous work in this area involved natural language processing to segment queries based on known rules, or purely spatial approaches that are difficult to maintain and may have high latency. In this paper, we present a new approach to geocoding - BingGC - that makes fulfillment of extremely diverse geocoding queries possible via a combination of traditional web search technologies and a novel algorithm that uses textual search and spatial joins to quickly find results. It allows resolution of up to s spatially co-located entities in a single query with no pre-computation or rule-based matching. We provide experimental analysis of our system compared against leading online geocoders.

[1]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[2]  Kentaro Toyama,et al.  Robust location search from text queries , 2007, GIS.

[3]  Christopher M. Bishop,et al.  Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[4]  Yi Chang,et al.  Yahoo! Learning to Rank Challenge Overview , 2010, Yahoo! Learning to Rank Challenge.

[5]  Paul A. Viola,et al.  Learning to extract information from semi-structured text using a discriminative context free grammar , 2005, SIGIR '05.

[6]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[7]  P. Zandbergen Geocoding Quality and Implications for Spatial Analysis , 2009 .

[8]  Bernhard Seeger,et al.  Geographic Information Retrieval , 2004, WebDyn@WWW.

[9]  Shashi Shekhar,et al.  Spatial databases - a tour , 2003 .

[10]  Avi Arampatzis,et al.  The design and implementation of SPIRIT: a spatially aware search engine for information retrieval on the Internet , 2007, Int. J. Geogr. Inf. Sci..

[11]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[12]  Peter Sanders,et al.  Engineering efficient error-correcting geocoding , 2011, GIS.

[13]  Torsten Suel,et al.  Efficient query processing in geographic web search engines , 2006, SIGMOD Conference.

[14]  Alberto H. F. Laender,et al.  Semantic Expansion of Geographic Web Queries Based on Natural Language Positioning Expressions , 2007, Trans. GIS.

[15]  Udayan Khurana,et al.  Crosslingual location search , 2008, SIGIR '08.

[16]  Qiang Wu,et al.  Adapting boosting for information retrieval measures , 2010, Information Retrieval.

[17]  Craig A. Knoblock,et al.  From Text to Geographic Coordinates: The Current State of Geocoding , 2007 .

[18]  Christopher B. Jones,et al.  Geographical information retrieval , 2008, Int. J. Geogr. Inf. Sci..

[19]  Xuehua Wang,et al.  A geocoding algorithm for natural language address with neighborhood properties , 2012, Proceedings of 2012 2nd International Conference on Computer Science and Network Technology.