Import of Tenerife's hydraulic infrastructures
Import propossal for http://tareas.openstreetmap.es/project/44
Goal
The Hydrologycal Plan of Tenerife (PHT) contains a list of hydraulic infrastructures (rafts, reservoirs, dams, wells, treatment plants, etc.) that are interesting to incorporate into the map of Tenerife. The list includes for each infrastructure its name, type and coordinates of its location. With this import it is intended:
- Merge from this list the name of the infrastructures with those existing in OSM.
- Use the coordinates of the list to locate on the aerial image the elements that do not exist in OSM, draw infrastructures that correspond to ways and correct the position -in case of error- if it corresponds to nodes. New elements will be added to OSM only if they can be checked on another data source.
Schedule
This is a long-term work without strict planning. Important milestones will be noted here.
- 03-06, 2017. Discussion in the Talk-es list (in spanish).
- 03-23, 2017. Discussion in the imports list.
- 03-29, 2017. Message to Telegram room. Work begins. Publication in task manager.
- 05-24, 2017. Work done. Validation remains pending.
Import Data
Background
- Data source site: http://www.planhidrologicodetenerife.org
- Data license: Public Domain
- Link to permission: artículo 13 de la Ley de Propiedad Intelectual española
- Attribution: source=Plan Hidrológico de Tenerife in the changeset tags.
Permission
Spanish legal provisions and their corresponding projects are not subjected to copyrights according to Article 13 of the Spain Intellectual Property Law.
Data description
Documento PDF (1.5 Mb) with a list of hydraulic infrastructures. The list is divided into pages, each page contains in the header the type of Infrastructure whose value is used to define to which map feature it corresponds in OSM and apply the corresponding tags. These are the fields from the list that will be used in the import.
- Denominación (denomination): Infrastructure names.
- Estado (state): Values EX = Exist, PR = Predicted and PL = Planned. Only existing infrastructures (EX) will be used.
- X: UTM coordinate X in projection EPSG: 32628.
- Y: UTM coordinate Y in projection EPSG: 32628.
The list includes 5580 infrastructures.
Import Type
This is a one time, full manual import.
Data Preparation
Data Reduction & Simplification
After filtering only the existing infrastructures and those with coordinates, results a total of 3451 infrastructures. After eliminating the types of infrastructure not selected for import (see below) results a total of 2478 infrastructures.
Tagging Plans
Each element will have the following tags:
Original field | OSM tag | OSM value | Comment |
---|---|---|---|
Denominación | name=* + alt_name=* | * | In upcase |
X, Y | <node lat=* lon=*> | Geographic coordinates | In EPSG:32628 projection |
The value of Tipo de Infraestructura (type of infraestructure) in the page header is used to apply tags acording to this table:
Type of infraestructure | Count | OSM element | Tags | Comment | Example |
---|---|---|---|---|---|
- PRESA - BALSA - BALSA DE AGUA REGENERADA |
146 | Area | landuse=reservoir reservoir_type=water_storage |
Not covered water storage infrastructures. | 28975979 28975979 |
- DEPÓSITO ABASTECIMIENTO - DEPOSITO CUBIERTO DE AGUA REGENERADA - DEPÓSITO DE CABECERA AGUA DESALADA - DEPÓSITO INTERNO EDAM |
538 | area | building=yes man_made=reservoir_covered content=water |
Covered water storage infrastructures. | 208839002 208839002 |
- CENTRAL DE TURBINADO | 2 | area | building=yes | Hydroelectric power plant. | 476095077 476095077 |
- ESTACIÓN DE BOMBEO AGUA DESALADA - ESTACIÓN DE BOMBEO ABASTECIMIENTO - ESTACIÓN DE BOMBEO DE AGUAS REGENERADAS |
12 | area | building=yes | Water pumping infraestructure. | 476095088 476095088 |
- EBAR | 109 | area | building=yes | Waste water pumping infraestructures. | 476095086 476095086 |
- EDAM - EDAS |
49 | area | landuse=industrial man_made=water_works |
Infraestructures to transform sea water (EDAM) or brakish underground water (EDAS). | 461901634 461901634 |
- ERA - EDAR - ETAR - ESTACIÓN DE TRATAMIENTO PREVIO |
101 | area | landuse=industrial man_made=wastewater_plant |
Infraestructures related to waste water treatment. | 189876063 189876063 |
- POZO CONVENCIONAL - POZO SONDEO |
397 | Nodo | man_made=water_well | Water well for underground water catchment with a vertical drilling. | 2469328641 2469328641 |
- GALERÍA CONVENCIONAL - GALERÍA NACIENTE - GALERÍA POZO - GALERÍA SOCAVÓN |
1124 | Nodo | man_made=adit resource=water |
Man made horizontal adit to capture underground water. | 4697452924 4697452924 |
- MANANTIAL - POZO DE CAPTACIÓN DE AGUA DE MAR - POZO ORDINARIO - TOMADERO - CÁMARA DE MEZCLA - ESTACIÓN METEOROLÓGICA - SONDEO INVESTIGACIÓN - PUNTOS DE CONEXIÓN - PUNTO DE SUMINISTRO DE AGUA DESALINIZADA - POZO ABSORBENTE - POZO DE VERTIDO DE SALMUERA AGUA DESALADA |
1232 | N/A | N/A | Infrastructures discarded for OSM for different reasons, mainly to be difficult to locate over aerial image. |
Changeset Tags
- source=Plan Hidrológico de Tenerife; other data sources
- type=import
- url=http://wiki.openstreetmap.org/wiki/Tenerife_streams_import
Data Transformation
- A batch script infraesturas.sh is used to:
- Converts the PDF document to text format.
- Delete headers, footers and other texts that are not part of the infrastructure data.
- Filter only existing infrastructures and those with coordinates.
- Transform names by correcting the use of capital letters, spelling, abbreviations.
- Separate the main name and the alternative ones if any.
- Put the initial article at the beginning when they are at the end in parentheses.
- The generic Deposit, Presa, Gallery, is added at the beginning of the name, depending on the case.
- Convert the coordinates and reproject to EPSG: 4326.
- Eliminates the infrastructures of discarded types.
- Generates an XML OSM file assigning to each element tags according to the tagging plan.
cuadricula.py: generates a geojson file with a 2 km2 grid to split the data in tasks.
split.py: generats OSM files with the data splited by tasks.
Data Transformation Results
The software used and its results are hosted in this repository
- infraestructuras.csv: Sheet with the infraestructures data.
- infraestructuras.osm: Full data set to import.
- task/task???.osm: Data set splited in 449 files.
Workflow
Team Approach
Import will be undertaken by experienced OSM volunteers, following a strict workflow.
Tools
- The JOSM editor.
- The spanish tasking manager server.
Preliminary steps
- Create an account like usuario_imports with an email not already registered in OSM. If you use gmail, you can use your usual address adding "+imports" before the 'at'. Example: peter+imports@gmail.com
- Put this account in JOSM settings.
- Enable the remote control in JOSM: Settings (F12) > Remote control > Enable remote control.
- Install JOSM/Plugins/TODO_list.
- Download the desired files from here.
- Give authorization to use your openstreetmap account in the tasking manager in the link 'login to OpenStreetMap'.
- Select an area and clic in "Start contributing".
- Open this area in JOSM with the link 'Edit with Josm'.
Main steps
- Load the import OSM file corresponding to this area. You shouldn't upload this file.
- Select all in this layer and add to the ToDo list.
- Zoom to a node in the ToDo list. Check if it exists in the OSM data.
- If not exists and it correspond to an area draw it in the OSM layer using background aerial imagery.
- If it don't exists and it correspond to a node check the position and copy the node from the import layer to the OSM layer.
- If you can't locate the infraestructure using aerial imagery, ignore it and don't import.
- If it exists, check position and copy the name tag.
- Repeat until you clear the ToDo list.
Conflation
- If the OSM infraestructure it's not located in the position indicated by the PHT, check with aerial imagery.
- If there exists a conflict between the name in OSM and PHT, the last is the oficial data.
Last notes
- Make sure to use the user created in Josm for the import.
- Upload the edits to OSM assigning the #Changeset Tags.
- Mark the task as done in the tasking manager.
Regresion plan
In case of any trouble, JOSM/Plugins/reverter will be used.
Updates
These are static data, no updates are expected.
Quality Assurance
Each area edit can be validated in the tasking manager.