Skip to content

Project specification

Jonathan Schaeffer edited this page Mar 28, 2024 · 3 revisions

Context

Data findability in seismology is a complex task using information disseminated in different places. The aim of the project is to build a diagnostic of all the problems and inconsistencies that appears in the process of finding the data.

Typical workflow

Top down scenario

Starting from FDSN networks list

https://www.fdsn.org/networks

For each network start the test:

Does the network have a DOI ?

  • no: mark test as failed
  • yes: go to next step

Get datacite metadata for the DOI

Check if the landing page works

Check if data is open, if it has a licence

Pass a FUJ-I test for the DOI (optional)

This step is maybe long to pass, but could be very useful

store some metadata

  • the publisher
  • the licence
  • the FUJ-I score

from the routing system at FDSN

get the datacenter hosting the network

ask for the stationXML metadata at the datacenter

Check for the folowing :

  • restrictionPolicy
  • Identifier (the DOI should be the same)

Implementation

Backend

  • run the tests on a regular basis
  • store the result in a database
  • expose all objects through an API

Interesting tools for this:

  • django
  • celery

Frontend

  • web ui to expose:
  • all the tests runs
  • for a test run,
    • show global scores
    • show a list of networks with their score
    • for a network
      • show the full report

Other requirements

High test coverage

Let's try to enhance the code quality and aim at 90% of code coverage