California/Using CPAD data
Introduction
The California Protected Areas Database (CPAD) data published by GreenInfo Network, downloadable from here and noted in our Contributors page are slowly but surely being incorporated into OSM: not as an import, but more-or-less curated one polygon at a time on a case-by-case basis. These data might be almost anywhere in the state, but curated CPAD data are now (2019 and later) found in Santa Cruz, San Mateo and Santa Clara counties. CPAD includes all "protected as permanent open space lands in California:" 15,000 parks and preserves managed by over 1,000 agencies/organizations and owned by the people of the state of California.
Data format
CPAD is made available as three kinds of data agglomerations: SuperUnits (high-level), Units (medium) and Holdings (at a fine-grained, almost "individual parcel" level). See the CPAD data pages to determine which one might be appropriate for your use. SuperUnits might be best for an initial "wide scale" view of the data, so start there. If OSM desires greater detail, the Units and Holdings shapefiles might be used. Each of these are a zipped shapefile. To open them with JOSM, assure its opendata plugin is installed, optionally unzip the "data glom" you will use, and "open file" on the data file or unzipped shapefile (data file ending in .shp).
An example: from CPAD 2020b, here is a typical single-item datum (part of an Open Space Preserve in Santa Cruz) as the various tags found on the polygon:
ACCESS_TYP=Open Access ACRES=492.2348748251647 AGNCY_ID=1320 AGNCY_LEV=City AGNCY_NAME=Santa Cruz, City of AGNCY_TYP=City Agency AGNCY_WEB=http://www.cityofsantacruz.com/departments/parks-recreation CITY=Santa Cruz COUNTY=Santa Cruz DATE_REVIS=2006-01-31 DES_TP=Local Conservation Area GAP_STS=2 HOLDING_ID=2298 LABEL_NAME=Pogonip OSP LAND_WATER=Land LAYER=City MNG_AGNCY=Santa Cruz, City of MNG_AG_ID=1320 MNG_AG_LEV=City MNG_AG_TYP=City Agency PARK_URL=http://www.cityofsantacruz.com/departments/parks-recreation/parks/open-spaces/pogonip SITE_NAME=Pogonip Open Space Preserve SRC_ALIGN=PARCELS SUID_NMA=12041 UNIT_ID=18677 UNIT_NAME=Pogonip Open Space Preserve YR_EST=0 YR_PROTECT=0
That's a lot of data! Most of these are "internal to CPAD" and don't logically map well to any specific OSM tags. Some, however, do. Others, especially UNIT_ID and ACRES are useful to include in the data: UNIT_ID is useful as it allows a unique identifier from CPAD, and ACRES is useful as a sort of "checksum" of the data. What is meant by "checksum" is that a particular datum might change from version to version of CPAD, but this might be a very small or subtle change and isn't readily apparent or easily visually identifiable. But if there is a change between CPAD versions, the ACRES value will almost certainly change, especially out to one of its many digits of precision.
Generally area sizes from shapefiles should not be incorporated into OSM. Please see the Talk page for Discussion about this.
Suggested tag-migration protocol
So, which tags to keep, which to delete? Here are suggestions:
CPAD Tag | Action | New OSM key=value |
---|---|---|
ACCESS_TYP | Keep | Change to access=*. If "Open Access" set value "yes" if "Restricted Access" set value "private" if "No Public Access" set value "no" if "Unknown" set value "unknown" |
ACRES | Special | Change to cpad:acres=* with the same value. If multiple "sub-units" are coalesced to form a larger unit, delete these differing ACRES tags. |
AGNCY_ID | Delete | |
AGNCY_LEV | Keep | Change to ownership=*. If "Federal" set value "national" if "State" set value "state" if "County" set value "county" if "City" set value "city" |
AGNCY_NAME | Keep | Change to owner=* with the same value |
AGNCY_TYP | Special | See DES_TP, below, to possibly preserve its value, then delete this |
AGNCY_WEB | Special | Change to website=* with the same value, unless there is a PARK_URL which differs (if so, delete this tag and see PARK_URL). If empty, delete. |
CITY | Delete | |
COUNTY | Delete | |
DATE_REVIS | Delete | |
DES_TP | Optional | If you wish, and it distinctly adds clarity to "what this is," change to description=* with the same value. If you include the value of AGNCY_TYP here, too, semi-colon separate the values. |
GAP_STS | Delete | |
HOLDING_ID | Delete | |
LABEL_NAME | Delete | |
LAND_WATER | Delete | |
LAYER | Delete | |
MNG_AGNCY | Keep | Change to operator=* with the same value |
MNG_AG_ID | Delete | |
MNG_AG_LEV | Delete | |
MNG_AG_TYP | Delete | |
PARK_URL | Special | Change to website=* with the same value, unless this value is empty, then set AGNCY_WEB to website=* instead |
SITE_NAME | Delete | |
SPEC_USE | Delete | |
SRC_ALIGN | Delete | |
SRC_ATTR | Delete | |
SUID_NMA | Delete | |
UNIT_ID | Special | Change to cpad:unitid=* with the same value. If multiple "sub-units" are coalesced to form a larger unit, delete these differing UNIT_ID tags. |
UNIT_NAME | Keep | Change to name=* with the same value |
YR_EST | Delete | |
YR_PROTECT | Special | If 0 (zero), delete. If non-zero, change to start_date=* with the same value |
Add an additional OSM-specific "physical" tag, like leisure=nature_reserve or boundary=protected_area (plus the proper protect_class=* value, if it is correct to add one). The choice of which tag(s) to add is at the discretion of the author, but do strive to tag consistently with documented uses. See United_States/Public_lands for additional suggestions: these are intended to follow regularized tagging protocols as documented there.
Applying these suggestions, this results in the following tags in OSM on the example polygon from above:
access=yes
description=Local Conservation Area
leisure=nature_reserve
name=Pogonip Open Space Preserve
operator=Santa Cruz, City of
owner=Santa Cruz, City of
ownership=city
website=http://www.cityofsantacruz.com/departments/parks-recreation/parks/open-spaces/pogonip
In this specific example, other sub-unit polygons were coalesced together to form a larger cohesive unit, so there are neither cpad:unitid=18677 nor cpad:acres=492.2348748251647 tags. If this were an isolated (multi)polygon, there would be these tags.
Some might explicitly add source=CPAD 2020b to each datum, but it is a more modern convention to enter this on the entirety of the changeset during upload (JOSM has "Specify the data source for the changes"), where this may simply be set to "CPAD 2020b" (for example).
Extraneous data found in OSM
Sometimes shapefile tags in ALL CAPITAL LETTERS are left in the data when they do not logically map well to OSM tags. Where objectionable, these tags can be deleted from OSM. However, for the reasons indicated above, please leave intact (older) tags of ACRES and UNIT_ID. Or at most, change their keys to cpad:acres=* and cpad:unitid=*, respectively, leaving their values intact.