User:Randomjunk

From OpenStreetMap Wiki
Jump to navigation Jump to search


Anarchy
I don't give a flying monkey's for tag voting, automatic changebots, endless discussions, categories, or any of that crap, but prefer to get on and actually do stuff.

OpenStreetMap isn't a computer project, it's an outdoors activity.
Less of the bulk imports and wikifiddling please.
Go outside and map!

Public-images-osm logo.svg
I support OSM by being a
member of the
OpenStreetMap Foundation
Do you?

I support the license
upgrade to ODbL v 1.0

ODbL-Supporter.png


My OSM user is randomjunk.

Cycle Map

I'm helping Andy out with his cycle map. He uses my machine to render the tiles.


GPS points

I thrashed the OSM DB server for a while last week, and produced some interesting images of the GPS points that people have uploaded to OSM.

For example: European GPS Points and London GPS points

more images...

It would be a bit tricky to do anything for a larger area because of the difficulty in obtaining these points. The European image contains 115 million GPS points, and I had to make more than a few thousand queries to the DB to get them (spread over about 24 hours though). It would be useful to have a "Planet.gpx"... but most of the time I can't imagine it would be any use... just the odd pretty picture.

The code to render these images works by building a 2D frequency array of points and then scaling each point against the 95th percentile point to generate the image. It's a complete hack script, and I haven't tried to tidy it up at all. To run it do:

python renderPoints.py <gps_file> <width> <imagename> [<bbox>]

If you don't supply a bbox then the program will make two scans of the gps file -- the first calculates the extent. The GPS file is a simple file containing a point on each line, each point being two floating point numbers separated by a tab. The file can be gzipped.

Post codes

Using the data from npe-maps and free the postcode, I created a slippy map showing the rough postcode area polygons that the data implies.

The code for this is on the page. The tiles are rendered using Mapnik.


Multi country

I was playing with rendering differently for different countries.

This is really simple.

  • take the world boundaries shape file used by the mapnik slippy map
  • import them into postgres
  • use the postgis functions to split the boundaries into tiles (this improves efficiency for the next step)
  • for every OSM line use the postgis functions to figure out which country the line intersects with... if matches multiple then just pick the first

It has some problems:

  • the boundary polygons aren't super accurate... we've been using them for border plotting on low zoom though so they're not that bad.
  • the Mapnik style files become completely impossible to edit... you have to add country conditions into every rule, and if you want a default style it has to explicitly exclude all the other countries you've used (I haven't done this in the above example render and you can see the problem... maybe possible to fix this in the SQL -- would need to basically create a table for each country. The alternative is to improve Mapnik's file format.

Progress

LondonProgress.gif

I'm currently creating every week some over view plots of various areas to demonstrate progress in OSM.

These used to be rendered using Osmarender (for the London one.. I wasn't doing the others) but unfortunately that method suffers from an exponential increase in time and memory usage as the size of OSM data increases. When the Greater London .svg file topped 250MB everything broke.

So now I'm using Mapnik. The process is:

  • download the planet every week
  • import using osm2pgsql
  • use generate_image.py with a modified osm.xml style file

This process is approximately linear with the planet size (possibly n log n... haven't really thought about it much) -- most of the time is spent importing the planet (about 5 hours). Anyway it renders the entire USA in about 5 minutes... so it's not slow once the data is imported.


Osmarender way

For anybody interested in how File:LondonProgress.gif was rendered (and hence the above animation), get ready to be scared to death by the hackery:

First off the data is grabbed using the following python script which downloads it bit by bit in "tiles". It's done like this so as not to stress the OSM server too much by requesting large areas (areas this big aren't allowed anyway), and to spread the requests out over an hour or two too. I'd use the planet file except that it wasn't very reliable at the time I started this, and it seemed a bit pointless downloading 350MB for just a tiny lump of it.

import urllib
import time
import os

trlon=0.291137
trlat=51.73723
bllon=-0.57678
bllat=51.23096

lat=bllat
lon=bllon

while lat < trlat:
	while lon < trlon:
		print "getting %f,%f" % (lat, lon)
		if os.path.exists('map%f_%f.osm' % (lon, lat)):
			print "skipping as file present"
		else:
			data = urllib.urlopen('http://<osmid>:<password>@www.openstreetmap.org/api/0.3/map?bbox=%f,%f,%f,%f' % (lon,lat,(lon+0.035),(lat+0.035))).read()
			f = open('map%f_%f.osm' % (lon, lat), 'w')
			f.write(data)
			f.close()
			print "retrieved data, sleeping to not pound OSM too much at once"
			time.sleep(10)
		lon += 0.035
	lat += 0.035
	lon = bllon

Then all the "tiles" are stuck together. This is the dirty bit! (footer and header are just files containing an XML OSM file start and end, the awk is there to strip the <bounds> elements the api now puts in):

cat header > combined
for f in *.osm
do
  awk '!/<bounds/ {print}' $f | tail --lines=+3 $f | head --lines=-1 >> combined
done
cat footer >> combined
mv combined osmarender/data.osm

The data.osm file is 35MB for this area.

I then used xsltproc to do the render to SVG using a standard osmarender rules file, but I did modify it to make the tube lines show up better, and to turn on segments painting. The resulting SVG is 63MB... a bit huge, and takes a good half hour and most of the RAM in my box to do.

I then used rsvg to render the SVG to PNG specifying the width as 1280 pixels. This takes just a couple of minutes, and only uses a moderately stupid amount of RAM to achieve.

I've been doing this once a week since mid October 2006, you can see the results of this here.

At the current growth rate of done areas I'm going to run out of RAM in about 6 months :-( ... yup.. run out of RAM... rethink required: going with Mapnik -- see above.