What We Are Doing

Standards & Patterns

A small set of lightweight 'data package' standards and patterns providing a base structure on which tooling and integration can build.

Tooling & Integration

Making it easy to use and publish data packages from your existing apps and workflows whether that's Excel, R, or Hadoop!

Outreach & Community

Engaging and evangelizing around the concepts, standards and tooling and building a community of users and contributors.

Find out more about our Vision

There's too much friction working with data - friction getting data, friction processing data, friction sharing data.

This friction stops people: stops them creating, sharing, collaborating with data.

It stops the cycle of find, improve, share that would make for a dynamic and productive data ecosystem.

We need to make an open data ecosystem that, like open-source for software, is useful and attractive to those without any principled interest, the vast majority who simply want the best tool for the job, the easiest route to their goal.



Our Key

1 Focus

Have a tight focus on one (small) part of the data chain, one specific feature (packaging), a few specific types of data (e.g. tabular).

2 The Web

Build for the web. Use formats that are web "native" (JSON) and work naturally with HTTP (e.g. CSV streaming, CSV plain text nature). Distributed not centralized.

3 Existing Tools

Integrate as easily as possible with existing tools both by building integrations and designing for direct use – for example we like CSV because everyone has a tool that can access CSV.

4 Minimalist

Do the least possible – e.g. require only the most essential metadata (but allow for extensibility), data should have nothing extraneous be it blank lines or footnotes.

5 Simplicity

Use the most basic formats. Add the minimum, do the least needed. This is also a criteria that runs through all other principles.

Related Projects at Open Knowledge