View

Online Data Package viewer app lets you get a nice human-friendly view of a Data Package in seconds.

Create

Online datapackage.json maker creates the datapackage.json file needed to turn data into a Data Package.

Validate

Online validator checks your datapackage.json and Data Package are good to go.


Using Tabular Data Packages With ...

These "Using with" examples usually require Tabular Data Packages where the data in the Data Package is stored in CSV (most the core datasets on this website are in this format)

Using with Excel

For Excel you can just open the CSV file! We're also working on a macro – see this issue for details.

Using with Google Spreadsheets

Google Spreadsheet Import (in progress)

Using with R and R Client

Using with Postgresql

There is a python script (with no dependencies) to load a Tabular Data Package Data Package into Postgresql

As an alternative you can use the datapak gem (in Ruby w/ ActiveRecord) to load Tabular Data Packages into Postgresql.

Using with SQLite

There is a simple python script (no dependencies) to load a Tabular Data Packages into SQLite

As an alternative you can use the datapak gem (in Ruby w/ActiveRecord) to load Tabular Data Packages into SQLite.

Using with MATLAB

A function to read data from a Tabular Data Package is available for download from MATLAB Central's File Exchange. To contribute to the library, see the project's GitHub repository.

Using with SQL Server

There is a BIML project that uses datapackage.json to generate SSIS packages that can load the contents of a Tabular Data Package into a SQL Server database. Find out more about SQL Server Integration Services (SSIS).


Libraries

datapackage-* (nodejs)

There is a comprehensive set of NodeJS libraries which are "officially" supported.

datapackage (python module)

A module useful for loading and managing data packages in Python. The module is available in PyPI so it can be installed using:

pip install datapackage

The source code is a available under the GPL.

R Library

See previous section

MATLAB Library

See previous section

Ruby Library

Ruby library for parsing and validating both data packages and tabular data packages.

You can use the datapak gem to work with tabular data packages (lets you download, load or query datasets using SQL via ActiveRecord - thus, works with any SQL database; defaults to an in-memory SQLite database).


Command Line

Data Package Manager (dpm)

data package manager (dpm) - comprehensive command line tool.


Other Tools

Data Validation

  • Good Tables is both a library and online web service for validating tabular data against a JSON Table Schema. It is available as both a Python library and a web service for validating tabular data and getting actionable information to fix errors.
  • CSVLint

Data Package Viewer

Have a new tool to add to this page? Please let us know by opening an issue or editing this page (see the link above).

Related Projects at Open Knowledge