Skip to content

A few scripts to pull IATI data from the web, based on the information in iatiregistry.org, and using the CKAN API

License

Notifications You must be signed in to change notification settings

IATI/IATI-Registry-Refresher

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

66 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

IATI Registry Refresher

License: MIT

Introduction

This small aplication allows you to query the CKAN implementation at iatiregistry.org to find all 'end point' urls of data recorded on the registry, and to then download that data.

The registry holds records about where data can be found on the interent.

The application is basically 2 scripts that you run one after the other.

grab_urls.php queries the registry and creates a text file for each group on the registry of all url end points of IATI data files

fetch_data.sh uses wget to pull all the data from those url text files and deposit them in their own directory.

Requirements

IATI Registry Refresher requires PHP version 5.2.0 or later.

It also requires curl. on Ubuntu

sudo apt-get install curl
sudo apt-get install php5-curl php5-cli

Installation and usage

Place all files in the same directory. Create an empty directories called urls, data and ckan

mkdir urls data ckan

From a terminal, use php-cli (command line interface) to run:

php grab_urls.php

(if you want to set up your own paths, copy this file to e.g. grab_my_urls.php and edit the paths.) This gives the data endpoints for all the files in the IATI registry (see http://iatiregistry.org/)

Run fetch_data.sh to get all the data. (In a terminal type ./fetch_data.sh) (if you want to set up your own paths, copy this file to e.g. fetch_my_urls.sh and edit the paths.)

Creating a git data snapshot

The code in git.sh can be used to update a git repository (in the data directory) with a new commit each time it is run. The IATI Tech Team maintains a git repository with nightly snapshot commits, but it is not public, see https://github.com/IATI/IATI-Stats#getting-some-data-to-run-stats-on

Wget Caveats

If your copy of wget is compiled against an old version of gnutls, then some https downloads will fail. Please make sure your system has the latest version of gnutls installed.

Bugs, issues and feature requests

If you find any bugs, note any issues or have any feature requests, please report them at https://github.com/caprenter/IATI-Registry-Refresher

Licence

Copyright 2012 caprenter <[email protected]>
     
This file is part of IATI Registry Refresher.
     
IATI Registry Refresher is free software: you can redistribute it and/or modify
it under the terms of the GNU General Public License as published by
the Free Software Foundation, either version 3 of the License, or
(at your option) any later version.
    
IATI Registry Refresher is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
GNU General Public License for more details.
    
You should have received a copy of the GNU General Public License
along with IATI Registry Refresher.  If not, see <http://www.gnu.org/licenses/>.

About

A few scripts to pull IATI data from the web, based on the information in iatiregistry.org, and using the CKAN API

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published