GovLab has found that large civic datasets are hard to use on the two most common data portals in the US, CKAN and Socrata. The portals' online tools don't support many important statistical or SQL functions. It's unclear where to find related datasets essential to analysis, and then stitching these sets back together and importing them into a relational database takes time and research.
To make this data more accessible, we're proud to announce a new tool: Docker for Data.
Docker for Data is simple. It's a one-step download that fires up a Postgres database ready to go with big civic datasets in just a few minutes.
Behind the scenes, it's just a Docker container with the data inside. We'd love it if you could try it out, and try to run your own.
Any and all comments are welcome. We're currently working in a few directions:
- Moving out visualization to separate containers. Tools like splunk can do a better job than we can.
- Unbinding the data from the main container, to allow for "data a-la-carte" (only download the datasets you care about.)
- Provide a sustainable hosted solution, so that people can spin up their own Docker4Data without having to setup DigitalOcean, AWS, or boot2docker.