Science Archive

An application providing an API to save, retrieve, and view an observatory's science data. The data files themselves are stored in a FileStore (S3 recommended), with certain metadata for each file stored in a database for easy querying and filtering. This application relies on the OCS Archive library for configuration of its FileStore and input DataFile formats, and on the OCS Ingester library to ingest file metadata into the archive and upload the files themselves to the FileStore configured.

Prerequisites

Optional prerequisites can be skipped for reduced functionality.

Python >= 3.7
PostgreSQL with the PostGIS extension installed
A FileStore (S3 recommended) with read/write privileges and versioning enabled (if S3)
System dependencies to install the psycopg2 package
(Optional) RabbitMQ
(Optional) Memcached
(Optional) Nginx with mod-zip plugin serving the archive (needed to support downloading zip files of multiple images at once)

Configuration

Users can use Oauth for authentication. The authentication server for the science archive is the observation portal.

This project can be configured to use just a single database, or queries can be routed to separate endpoints of a database cluster such that read operations are routed to replicas and write operations are routed to the main writer database.

The FileStore and DataFile format are configured using the environment variables specified in the OCS Archive library, so please look over those first and set them properly. Specifically, the FILESTORE_TYPE environment variable must be set for your FileStore backend, since it defaults to dummy which stores no data. The header mapping environment variables should also be set to map to the correct keys in your data products as well. These environment variable values must match those used in the OCS Ingester library to upload and ingest data products. For more information on how to customize your OCS Science Archive, please review the data flow documentation.

This project is configured using environment variables.

	Variable	Description	Default
General	`SECRET_KEY`	Django Secret Key	random string
	`DEBUG`	Enable Django Debugging Mode	`False`
	`CACHE_LOC`	Memcached Cache Location	`memcached.archiveapi:11211`
Database	`DB_HOST`	PostgreSQL Database Hostname for the writer endpoint	`127.0.0.1`
	`DB_HOST_READER`	PostgreSQL Database Hostname for the reader endpoint. This can be set to the same value as `DB_HOST` if a cluster is not being used.	`127.0.0.1`
	`DB_PORT`	Database port to accompany the `DB_HOST`	`5432`
	`DB_NAME`	PostgreSQL Database Name	`archive`
	`DB_USER`	PostgreSQL Database Username	`postgres`
	`DB_PASS`	PostgreSQL Database Password	`postgres`
AWS	`AWS_ACCESS_KEY_ID`	AWS Access Key Id	empty string
	`AWS_SECRET_ACCESS_KEY`	AWS Secret Access Key	empty string
	`AWS_DEFAULT_REGION`	AWS Default Region	`us-west-2`
	`S3_ENDPOINT_URL`	Endpoint url for connecting to s3. This can be modified to connect to a local instance of s3.	`http://s3.us-west-2.amazonaws.com`
Post-processing	`PROCESSED_EXCHANGE_ENABLED`	Enable post-processing. When `True`, details of a newly ingested image are sent to a RabbitMQ exchange. This is useful for e.g. data pipelines that need to know whenever there is a new image available. Set to `False` to disable.	`True`
	`QUEUE_BROKER_URL`	RabbitMQ Broker	`memory://localhost`
	`PROCESSED_EXCHANGE_NAME`	Archived FITS exchange name	`archived_fits`
Expire Guide Frames	`GUIDE_CAMERAS_TO_PERSIST`	comma delimited list of guide camera names to exclude from expiring after 1 year	empty string
Oauth	`OAUTH_CLIENT_ID`	Oauth client ID	empty string
	`OAUTH_CLIENT_SECRET`	Oauth client secret	empty string
	`OAUTH_TOKEN_URL`	Observation portal Oauth token URL	`http://localhost/o/token/`
	`OAUTH_PROFILE_URL`	Observation portal profile URL	`http://localhost/api/profile/`
	`OAUTH_SERVER_KEY`	Observation portal server secret key to authenticate calls from the server (should match the value in the Observation Portal deploy)	empty string
Configuration Types	`CONFIGURATION_TYPES`	Comma delimited list of configuration types to use for validation and forms. Only used if no `CONFIGDB_URL` is set.	`BIAS,DARK,EXPOSE,SPECTRUM,LAMPFLAT,SKYFLAT`
	`CONFIGDB_URL`	Configuration Database URL. If set, it is used to retrieve available configuration_types.	empty string
Appearance Settings	`NAVBAR_TITLE_TEXT`	Name that appears in the navbar of the browsable api	`Science Archive API`
	`NAVBAR_TITLE_URL`	Hyperlink for the NAVBAR_TITLE_TEXT	`https://archive.lco.global`
	`PAGINATION_DEFAULT_LIMIT`	Numeric value indicating the page size for results (more info here)	`100`
	`PAGINATION_MAX_LIMIT`	Numeric value indicating the maximum allowable limit that can be requested by the client. (more info here)	`1000`
More customization	`ZIP_DOWNLOAD_FILENAME_BASE`	Initial part of the zip download filename	`ocs_archive_data`
	`ZIP_DOWNLOAD_MAX_UNCOMPRESSED_FILES`	Maximum number of files that users can bundle in a single uncompressed zipped download	`10`
	`TERMS_OF_SERVICE_URL`	URL pointing to a terms of service for users of the observatory	`https://lco.global/policies/terms/`
	`DOCUMENTATION_URL`	URL pointing to user-facing documentation	`https://observatorycontrolsystem.github.io/api/science_archive/`

Local Development

Set up the S3 bucket

Please refer to the S3 documentation for how to set up a bucket with read/write access.

Set up a virtual environment

Using a virtual environment is highly recommended. Run the following commands from the base of this project. (env) is used to denote commands that should be run using your virtual environment. Note that the system dependencies of the psycopg2 PyPI package must be installed at this point.

python3 -m venv env
source env/bin/activate
(env) pip install -r requirements.txt

Set up the database

This example uses a PostgreSQL Docker Image that already has PostGIS installed. Make sure that the options that you use to set up your database correspond with your configured database settings.

docker run --name archive-postgres -e POSTGRES_PASSWORD=postgres -e POSTGRES_DB=archive -p5432:5432 -d mdillon/postgis

After creating the database, migrations must be applied to set up the tables in the database.

(env) python manage.py migrate

Run the tests

(env) python manage.py test --settings=test_settings

Run the science archive

(env) python manage.py runserver

The science archive should now be accessible from http://127.0.0.1:8000

Adding data

Only superusers can ingest data into the science archive. To create a superuser, run the following command and follow the steps:

(env) python manage.py createsuperuser

Obtain the resulting authentication token which can then be used by the ingester.

python manage.py shell_plus
In [1]: User.objects.first().auth_token
Out[1]: <Token: 48d03ec62ce69fef68bd545a751ccb1efef689a5>

Name		Name	Last commit message	Last commit date
Latest commit History 687 Commits
.github/workflows		.github/workflows
archive		archive
deploy		deploy
templates		templates
.coveragerc		.coveragerc
.dockerignore		.dockerignore
.envrc		.envrc
.gitignore		.gitignore
.poetry-version		.poetry-version
CHANGELOG		CHANGELOG
Dockerfile		Dockerfile
LICENSE.md		LICENSE.md
README.md		README.md
devenv.lock		devenv.lock
devenv.nix		devenv.nix
devenv.yaml		devenv.yaml
manage.py		manage.py
poetry.lock		poetry.lock
poetry.toml		poetry.toml
pyproject.toml		pyproject.toml
test_settings.py		test_settings.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Science Archive

Prerequisites

Configuration

Local Development

Set up the S3 bucket

Set up a virtual environment

Set up the database

Run the tests

Run the science archive

Adding data

About

Releases 7

Packages

Contributors 13

Languages

License

observatorycontrolsystem/science-archive

Folders and files

Latest commit

History

Repository files navigation

Science Archive

Prerequisites

Configuration

Local Development

Set up the S3 bucket

Set up a virtual environment

Set up the database

Run the tests

Run the science archive

Adding data

About

Resources

License

Stars

Watchers

Forks

Releases 7

Packages 0

Contributors 13

Languages

Packages