Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Write biblio details import lib #135

Open
dosumis opened this issue Jul 7, 2021 · 7 comments
Open

Write biblio details import lib #135

dosumis opened this issue Jul 7, 2021 · 7 comments
Assignees

Comments

@dosumis
Copy link
Contributor

dosumis commented Jul 7, 2021

(Ticket may not belong here)

We need a generic biblio details import extension for VFB pipeline2. This should use lookups via NCBI pubmed API or DOI to pull general bibliographic information into the integration layer (triplestore).

To investigate - can we follow/re-use some existing standard for this?

@dosumis
Copy link
Contributor Author

dosumis commented Jul 7, 2021

@dosumis
Copy link
Contributor Author

dosumis commented Jul 7, 2021

Also try EuroPMC - might give preprints too

@hkir-dev
Copy link
Contributor

Coverage of the EuroPMC seems good: https://www.ebi.ac.uk/europepmc/webservices/rest/search?query=DOI%3A10.1101%2F2020.02.29.970558&resultType=lite&cursorMark=*&pageSize=25&format=dc

We can re-use Dublin Core response format of this service.

@hkir-dev hkir-dev self-assigned this Jul 14, 2021
@dosumis
Copy link
Contributor Author

dosumis commented Jul 15, 2021

Looks great. I like the idea of following DC response format as standard. Assume importing to triple-store => individual with IRI http://europepmc.org/abstract/PPR/PPR115538 ?

We need to think through how to wire these up to Classes/Individuals - or if we don't, how to index so details can easily be retrieved by indexed ID.

References to publications are commonly found in IRIs on definitions, e.g. see PMID here:

"definition": [
"{"annotations":{"database_cross_reference":["PMID:12162730","MESH:A08.637.200","GOC:dsd","MESH:D001253","GOC:tfm","PMID:11746784","PMID:20942978","http://en.wikipedia.org/wiki/Astrocyte","PMID:12898703"]},"value":"A class of large neuroglial (macroglial) cells in the central nervous system - the largest and most numerous neuroglial cells in the brain and spinal cord. Astrocytes (from 'star' cells) are irregularly shaped with many long processes, including those with 'end feet' which form the glial (limiting) membrane and directly and indirectly contribute to the blood-brain barrier. They regulate the extracellular ionic and chemical environment, and 'reactive astrocytes' (along with microglia) respond to injury."}"
]

May also be found on edges. Given that - maybe an indexed key lookup would be best?

@hkir-dev
Copy link
Contributor

hkir-dev commented Jul 15, 2021

Our current structure is as follows:

    <owl:Axiom>
        <owl:annotatedSource rdf:resource="http://www.semanticweb.org/brain_data_standards/AllenDendClass_CS202002013_1"/>
        <owl:annotatedProperty rdf:resource="http://purl.obolibrary.org/obo/IAO_0000115"/>
        <owl:annotatedTarget rdf:datatype="http://www.w3.org/2001/XMLSchema#string">A GABAergic interneuron ...</owl:annotatedTarget>
        <oboInOwl:hasDbXref rdf:datatype="http://www.w3.org/2001/XMLSchema#string">DOI:10.1101/2020.02.29.970558</oboInOwl:hasDbXref>
        <oboInOwl:hasDbXref rdf:datatype="http://www.w3.org/2001/XMLSchema#string">DOI:10.1101/2020.03.31.016972</oboInOwl:hasDbXref>
    </owl:Axiom>

I was planning to add one more annotation to this axiom using dc:source :

<dc:source rdf:resource="http://europepmc.org/abstract/PPR/PPR115538/>

Do you have a type recommendation for biblio instances (at worst we can use BibliographicResource)?

In the vfb_dumps we can construct an explicit relation (dc:source?) between AllenDendClasses and the related biblio instances.

@dosumis
Copy link
Contributor Author

dosumis commented Jul 16, 2021

Proposal:

Add rdf as returned by EuroPMC API - dc structure, but add an additional triple with value query string.

e.g.

query with https://www.ebi.ac.uk/europepmc/webservices/rest/search?query=doi:10.1101%2F2020.02.29.970558&resultType=lite&cursorMark=*&pageSize=25&format=dc

Add:

http://europepmc.org/abstract/PPR/PPR115538 :DOI 10.1101/2020.02.29.970558 .

Or PMID.

@hkir-dev
Copy link
Contributor

skos:exactMatch relation used for storing query string:

http://europepmc.org/abstract/PPR/PPR115538 skos:exactMatch "DOI:10.1101/2020.02.29.970558"

Implementation of the related feature is completed and PR created
VirtualFlyBrain/vfb-pipeline-collectdata#20

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants