Geographic Information Systems Asked by Lincoln Mullen on February 21, 2021
I have a shapefile with several attributes, for example "YEAR"
, "COUNTY"
, and "AREA"
. I also have a CSV file with more fields that I want in the shapefile, such as "POPULATION"
. Both the shapefile and the CSV file have a field "GISJOIN"
.
I know how to do a join in QGIS. But how can I make a permanent join and write to a shapefile using ogr2ogr
or one of the other tools in GDAL/OGR?
The ogr2ogr utility supports a limited sql syntax. You can join your CSV to the shapefile using something like the following:
ogr2ogr -sql "select inshape.*, joincsv.* from inshape left join 'joincsv.csv'.joincsv on inshape.GISJOIN = joincsv.GISJOIN" shape_join.shp inshape.shp
Correct answer by user2856 on February 21, 2021
The accepted answer is really useful, but I found that it was slow with a large-ish database. I believe it also limits your options when joining the data.
My method now is to pull everything into SQLite (using a combination of csvkit and ogr2ogr):
csvsql --db sqlite:///myjoindb.db --insert myjoincsv.csv
ogr2ogr -append -f "SQLite" myjoindb.db myjoinshp.shp
Then join everything and create a shapefile out of it:
ogr2ogr -f "ESRI Shapefile" -sql "SELECT csv.*, shp.* FROM myjoinshp shp INNER JOIN myjoincsv csv ON csv.joinfield = shp.joinfield" joined_output.shp myjoindb.db
Answered by eric brelsford on February 21, 2021
Based on Eric's answer, this is an approach using the SQLite command instead of csvkit. If you're not already starting from a GeoPackage, convert to it first:
ogr2ogr -f GPKG myjoinshp.gpkg myjoinshp.shp
Then, exploiting the fact that a GeoPackage is actually an SQLite database, import the CSV as a new table (named myjoincsv
), which modifies the existing GeoPackage:
sqlite3 myjoinshp.gpkg --cmd '.mode csv' '.import myjoincsv.csv myjoincsv'
This step could also be done in a GUI, e.g. DB Browser for SQLite supports importing a CSV into the loaded database by drag and drop.
Finally, make a new GeoPackage or shapefile, using a join, as in Eric's answer:
ogr2ogr -f GPKG joined_output.gpkg myjoinshp.gpkg -sql "SELECT shp.*, csv.* FROM myjoinshp shp JOIN myjoincsv csv ON shp.joinfield = csv.joinfield"
Answered by Socci on February 21, 2021
Get help from others!
Recent Answers
Recent Questions
© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP