Skip to content

A framework that combines data from the UMLS with assertions from other data sources into a set of CSV files that can be imported into neo4j to build a Unified Biomedical Knowledge Graph (UBKG)

License

MIT, MIT licenses found

Licenses found

MIT
LICENSE
MIT
license.txt
Notifications You must be signed in to change notification settings

x-atlas-consortia/ubkg-etl

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Unified Biomedical Knowledge Graph (UBKG)

Source and Generation Framework Scripts

Scripts in this repo compose the following components of the UBKG:

Source Framework

The source framework

  • extracts from a data warehouse assertion data from the UMLS
  • formats output as CSVs for import into a neo4j knowledge graph

UMLS data is obtained from releases of the UMLS in Rich Release Format. The University of Pittsburgh's Department of Biomedical Informatics maintains the data warehouse that the source framework uses.

Generation framework

The generation framework is a suite of Extract-Transform-Load (ETL) scripts that

  • extract assertion data from sources, including OWL files
  • appends assertions to the set of UMLS CSVs

Additional information

For more information on the UBKG, consult the GitHub Docs site.

About

A framework that combines data from the UMLS with assertions from other data sources into a set of CSV files that can be imported into neo4j to build a Unified Biomedical Knowledge Graph (UBKG)

Resources

License

MIT, MIT licenses found

Licenses found

MIT
LICENSE
MIT
license.txt

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published