GitHub

In this work we present APNet, a Deep Learning approach with biological explainability to discover predictive drivers of COVID-19 severity focusing on distinct publicly available plasma proteomic datasets generated with the Olink Proximity Extension Assay (PEA) technology. This modular pipeline has the following structure:

APNet initially converts high-throughput omics expression matrices, such as plasma proteomics, into activity matrices that reflect the regulatory strength of transcription factors and signal drivers. This involves Bayesian-based differential expression and activity analysis through NetBID2 for bulk-omics and scMINER for single-cell data, utilizing SJARACNe co-expression graphs.
Then, APNet associates differentially active drivers with biological pathways using the Enrichr Knowledge Graph (KG).
Next, APNet employs PASNet, a deep learning approach, to provide interpretable predictions for patient classification. APNet uses the differentially active drivers and their associated pathways as inputs, applying sparse regularization to hierarchical driver-pathway connections. The interpretability is further improved by including Shapley values (SHAP).
Finally, APNet assembles bipartite driver-pathway networks based on SJARACNe co-expression graphs, incorporating information from earlier modules such as Mutual Information for driver-driver interactions and PASNet predictive weights for driver-pathway connections. These bipartite networks are valuable for investigating mechanistic hypotheses, using graph representation learning, shortest path retrieval and signal propagation simulation. To ensure generalizability and avoid over-fitting, we deployed APNet in three distinct Olink plasma proteomic datasets (>1420 proteins, > 800 cases separating training, validation and testing tasks per dataset) and discovered predictive drivers of severity in COVID-19, confirming biological ground truths but also uncovering nascent information with biological credence. APNet discovered more joint proteomic perturbations than typical differential expression analysis across the three datasets. Also, APNet outperformed state-of-the-art Machine Learning models that had performed predictions on these datasets and alternative iterations of APNet based on differential expression or on Random Forest. Part of the predictive drivers that APNet uncovered were traced to circulating blood cells based on a validation study using activity-transformed single-cell RNA-seq (scRNA-seq) data. Finally, analysis of APNet’s bipartite graphs uncovered information about ACAA1, a highly predictive but relatively unexplored by the literature driver, with syndecan SDC1, the keratin KRT18 and the mitochondrial co-chaperone GRPEL1 reflecting liver damage which emerges in severe cases of COVID-19.

Guidelines to run APNet:

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
configs		configs
data		data
docs		docs
notebooks		notebooks
results		results
supplementary		supplementary
#		#
.gitignore		.gitignore
APNet.Rproj		APNet.Rproj
LICENSE		LICENSE
Place		Place
README.md		README.md
SC1		SC1
file		file
here		here

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Contributors 4

Languages

License

BiodataAnalysisGroup/APNet

Folders and files

Latest commit

History

Repository files navigation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages