Skip to content

Preprocessing US Census TIGER data for Nominatim geocoder

License

Notifications You must be signed in to change notification settings

osm-search/TIGER-data

Repository files navigation

US TIGER address data for Nominatim

Convert TIGER/Line dataset of the US Census Bureau to CSV files which can be imported by Nominatim. In Nominatim the created tables are separate from OpenStreetMap tables and get queried at search time separately.

The dataset gets updated once per year. Downloading is prone to be slow (can take a full day) and converting them can take hours as well. There's a mirror on https://downloads.opencagedata.com/public/

Replace '2021' with the current year throughout.

  1. Install the GDAL library and python bindings and the unzip tool

    # Ubuntu:
    sudo apt-get install python3-gdal python3-pip unzip
    pip3 install -r requirements.txt
  2. Get the TIGER 2023 data. You will need the EDGES files (3,235 zip files, 11GB total).

    wget -r ftp://ftp2.census.gov/geo/tiger/TIGER2023/EDGES/
    
  3. Convert the data into CSV files. Adjust the file paths in the scripts as needed

    ./convert.sh

  4. Maybe: package the created files

    tar -czf tiger2023-nominatim-preprocessed.csv.tar.gz tiger

US Postcodes

Addtionally create a us_postcodes.csv.gz file with centroid coordinates.

cat tiger/*.csv | ./calculate_postcode_centroids.py | gzip -9 > us_postcodes.csv.gz

License

The source code is available under a GPLv2 license.

About

Preprocessing US Census TIGER data for Nominatim geocoder

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published