#  Geocoding 

 



Currently there are many options to choose from when geocoding batches of address data. The CGA has the most expertise with the Esri and Google geocoding services. Please find below instructions and guidance on using these geocoding services.

**Esri:** Harvard Key holders have access to use Esri's ArcGIS World Geocoding Service, see the "Geocoding\_Lab" document within [this tutorial](https://drive.google.com/file/d/1FAZqBvBYmM2f6LFcgttPCqnrFX76paCF/view?usp=sharing)[ ](https://stack.nerc.mghpcc.org:13808/swift/v1/AUTH_a61fb932012542e3ba28f546b14433c1/cga-web-documents/training/Geocoding_lab.zip)with sample data included to use the ArcGIS World Geocoding Service either within ArcGIS Pro (step 2 of the tutorial) or with ArcGIS Online (step 4). There's a maximum of 60,000 addresses per batch. If geocoding more than 60K addresses in the U.S., using a local copy of the geocoder is recommended, see instructions for this below.

**Google Maps Platform:** This [Geocoding API](https://developers.google.com/maps/documentation/geocoding/start) can be used to geocode worldwide addresses after obtaining an API key. You will need to enter a credit card to set up a billing account with Google, and they will give you $200/month credit for the first 12 months (view their [pricing structure](https://cloud.google.com/maps-platform/pricing/sheet/)). They will not charge your card until you give them permission to. See <https://cloud.google.com/maps-platform/>to get started. Use [this python script](https://cga-download.hmdc.harvard.edu/publish_web/Geocoding/google_geocoding_for_work_python_2.py) to load batches of addresses into the Google Maps Platform.

**Local Geocoding Options:** If addresses cannot be loaded onto a server due to HIPAA compliance or other restrictions, then these online Esri and Google geocoders cannot be used. To overcome this restriction, local geocoders are available through Esri's StreetMap Premium data source. These can be downloaded for use on a desktop PC running Windows operating system, and ArcGIS Pro software. The geocoders are organized by world regions, and can be accessed by contacting the CGA using our [contact form](/contactus) to receive a link to download the requested local geocoder, and receive a license to run the geocoder. After downloading to your PC, follow [these instructions](https://docs.google.com/document/d/1iz4SP7E-VH0vj-BLcchHDZ9bPh7clm89Wdv6mlJxX4E/edit?usp=sharing) to use the local geocoder.

The [DeGauss geocoder](https://degauss.org/) is another good option for geocoding U.S. data on one's local system. DeGAUSS runs on Windows, Linux, and Mac operating systems. See also the very useful [address string formatting tips](https://github.com/degauss-org/degauss-org.github.io/wiki/Geocoding-with-DeGAUSS#address-string-formatting) on the DeGauss website.

**Geocoding Big Data:** The Esri local geocoders can run millions of records per hour if optimized. The suggestions below are courtesy of [Adam Travis](https://sociology.fas.harvard.edu/people/adam-travis) on optimization:

- Making sure the locator is loaded onto a local drive, not a network drive.
- Use the singular locators such as "USA\_PointAddress.loc" instead of the composite locator "USA.loc".
- Set the number of threads allocated to 4.
- Limit the number of candidates to 10.
- Set a match score of &gt; 85.

**More on Geocoding:** The ability to assign specific geographic locations to textual information (the process known as geocoding) is available to anyone with a computer and internet access.

Geocoded locations expressed in latitude, longitude coordinates can be obtained one at a time in web maps such as Bing Maps or Google Maps (right click anywhere on the map and choose "What's here"). The relative ease of geocoding and resulting accuracy can vary widely depending on a number of factors. What is the nature of the data? How ‘clean’ is it and what format is it in? What geocoding technique will be used? Determining a geocoding strategy that best suits a particular need is not always clear.

The process of geocoding begins with comparing data in text or tabular form to a reference data table in geographic format. The reference table is a dataset that has already been mapped, with established map coordinates. When matches between the input data and the reference data are found, the corresponding map coordinates are assigned from the reference data to the input features, thus geocoding them. A geocoding service (also called an address locator) is a program that allows for a user to input a batch of data contained in a table, search for matches as compared to a reference table, and output the result in a map or GIS layer format. The key to confidently geocoding data lies in understanding the reference table which the data is being matched to, how a match is found, and the resulting spatial accuracy.



 



 

 See also:- [ Services ](/page-type/services)